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Vili Preface 


Because of this accident of history, physicists have ever since been trying to 
work backward to fathom the physical principles and symmetries that underlie 
the theory. Unlike Einstein’s theory of general relativity, which began with a 
geometric principle, the equivalence principle, from which the action could 
be derived, the fundamental physical and geometric principles that lie at the 
foundation of superstring theory are still unknown. 

To reduce the amount of hand-waving and confusion this has caused, three 
themes have been stressed throughout this book. To provide the student with 
a solid foundation in superstring theory, we have first stressed the method of 
Feynman path integrals, which provides by far the most powerful formalism in 
which to discuss the model. Path integrals have become an indispensable tool 
for theoretical physicists, especially when quantizing gauge theories. There- 
fore, we have devoted Chapter | of this book to introducing the student to the 
methods of path integrals for point particles. 

The second theme of this book is the method of second quantization. A|- 
though traditionally field theory is formulated as a second quantized theory, 
the bulk of superstring theory is formulated as a first quantized theory, pre- 
senting numerous conceptual problems for the beginner. Unlike the method of 
second quantization, where all the rules can be derived from a single action, 
the method of first quantization must be supplemented with numerous other 
rules and conventions. The hope is that the second quantized theory will reveal 
the underlying geometry on which the entire model is based. 

The third theme of this book is duality, which has revolutionized the way 
we formulate string theory. Duality, which was first discovered in Maxwell’s 
equations, allows us to determine the equivalence of two seemingly different 
theories. Using duality, we can show that the five different superstring theories 
in 10 dimensions are actually unified into a single theory in 11 dimensions, 
a still mysterious theory called M-theory, which reduces to | 1-dimensional 
supergravity in the low-energy limit. Since duality allows us to show the equiv- 
alence of a weak coupling theory to a strong coupling theory, it has allowed us 
to probe, for the first time, the nonperturbative region of string theory, where 
we find a host of new objects, such as membranes, M-branes, and D-branes. 
Although the complete action of M-theory is still unknown, already it has 
yielded a flood of new information concerning the strong coupling behavior 
of string theory. 

We now know that strings coexist with membranes of various dimensions; 
ultimately, the entire theory may be formulated in terms of a master theory in 1 1 
dimensions, perhaps in terms of membranes of some sort. Ironically, although 
the action for M-theory is not known, physicists believe that the theory exists 
because of the vast web of consistency checks given by duality. (However, 
because M-theory itself is such an obscure theory, we will still refer to the 
theory by the name superstring theory.) 

In addition to providing the student with a firm foundation in path integrals 
and field theory, the other purpose of this book is to introduce students to the 
latest developments in superstring theory, that is, to acquaint them with the 
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fast-paced areas that are currently the most active in theoretical research, such 
as: 


String field theory; 

Conformal field theory; 

Kac—Moody algebras; 

Multiloop amplitudes and Teichmiiller spaces; 
Calabi—Yau phenomenology; and 

Orbifolds and four-dimensional superstrings. 


The goal of this book is to provide students with an overview by which 
to evaluate the research areas of string theory and perhaps even engage in 
original research. The only prerequisite for this book is a familiarity with ad- 
vanced quantum mechanics. However, the mathematics of superstring theory 
has soared to dizzying heights. In order to provide an introduction to more 
advanced mathematical concepts, such as Lie groups, general relativity, super- 
symmetry, and supergravity, we have included a short introduction to them in 
the Appendix, which we hope will fill the gaps that may exist in the students’ 
preparation. 

For the student, we should mention how to approach this book. Chapters 1—S 
represent Part I, the results of first quantization. They form an essential foun- 
dation for the next chapters and cannot be skipped. Chapter |, however, may 
be skipped by one who is relatively fluent in the methods of ordinary quan- 
tum field theory, such as gauge invariance and Faddeev—Popov quantization. 
(But we emphasize that the method of path integrals forms the foundation 
for this book, and hence even an advanced student may profit from reviewing 
Chapter 1.) 

Chapters 2 and 3 form the heart of an elementary introduction to string 
theory. Chapter 4, however, can be omitted by one who only wants an overview 
of string theory. With the exception of the fermion vertex function and ghosts, 
most of the results of string theory can be developed using Chapters 2 and 3 
without conformal field theory, and hence a beginner may overlook this chapter. 
(However, we emphasize that most modern approaches to first quantized string 
theory use the results of conformal field theory because it is the most versatile. 
A serious student of string theory, therefore, should be thoroughly familiar 
with the results of Chapter 4.) 

Chapter 5 is essential to understand the miraculous cancellation of diver- 
gences of the theory, which separates string theory from all other field theories. 
Because the theory of automorphic functions gets increasingly difficult as one 
describes multiloop amplitudes, the beginner may skip the discussion of higher 
loops. The serious student, though, will find that multiloop amplitudes form 
an area of active research. 

Part II begins a discussion of the field theory of strings, and Part II examines 
phenomenology. The order of these two parts can be interchanged without 
difficulty. Each part was written to be relatively independent of the other, so 
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the more phenomenologically inclined student may skip directly to Part IT] 
without suffering any loss. 

Chapters 6 and 7 in Part II present the evolution of several approaches to 
string theory. Chapter 6 discusses the original light cone theory and how to 
quantize multiloop theories based on strings. However, Chapter 7 was written 
in a relatively self-contained fashion, so the serious student may skip Chapter 6 
and delve directly into the covariant theory. 

In Part III, the beginner may skip Chapter 8. The discussion of anomalies is 
rather technical and mainly based on point particles, and overlaps the discus- 
sions found in other books. Chapter 9 cannot be omitted, as it represents one of 
the most promising of the various superstring theories. Likewise, Chapter 10 
forms an essential part of our understanding of how the superstring theory may 
eventually make contact with experimental data. 

In Part IV, we finally have a presentation of duality and M-theory. In Chapter 
11, we have a basic presentation of M-theory, and how the five known super- 
stings can all be expressed in terms of single theory. In Chapter 12, we explore 
the duality relations that exist in D = 8, 6 and 4 dimensions, giving us, for the 
first time, a look at the nonperturbative region of string theory. In Chapter 13. 
we explore more advanced topics, such as D-brane physics, matrix models, 
and applications of D-branes to black hole physics. 

The author hopes that this will help both the beginner and the more advanced 
student to decide how to approach this book. 


Michio Kaku 
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First Quantization 
and Path Integrals 


CHAPTER 1 


Path Integrals and 
Point Particles 


1.1 Why Strings? 


One of the greatest scientific challenges of our time is the struggle to unite the 
two fundamental theories of modern physics, quantum field theory and gen- 
eral relativity, into one theoretical framework. Remarkably, these two theories 
together embody the sum total of all human knowledge concerning the most 
fundamental forces of Nature. Quantum field theory, for example, has had 
phenomenal success in explaining the physics of the microcosm, down to dis- 
tances less than 10~'° cm. General relativity, on the other hand, is unrivaled in 
explaining the large-scale behavior of the cosmos, providing a fascinating and 
compelling description of the origin of the Universe itself. The astonishing suc- 
cess of these two theories is that together they can explain the behavior of matter 
and energy over a staggering 40 orders of magnitude, from the subnuclear to 
the cosmic domain. 

The great mystery of the past five decades, however, has been the total 
incompatibility of these two theories. It’s as 1f Nature had two minds, each 
working independently of the other in its own particular domain, operating 
in total isolation of the other. Why should Nature, at its deepest and most 
fundamental level, require two totally distinct frameworks, with two sets of 
mathematics, two sets of assumptions, and two sets of physical principles? 

Ideally, we would want a unified theory to unite these two fundamental 
theories: 

Quantum field theory 


Unified field theory. 
General relativity pa emma 


However, the history of attempts over the past decades to unite these two 
theories has been dismal. They have inevitably been riddled with infinities or 
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have violated some of the cherished principles of physics, such as causality. 
The powerful techniques of renormalization theory developed in quantum field 
theory over the past decades have failed to eliminate the infinities of quantum 
gravity. Apparently, a fundamental piece of the jigsaw puzzle is still missing. 

Although quantum field theory and general relativity seem totally incom- 
patible, the past two decades of intense theoretical research have made it 
increasingly clear that the secret to this mystery most likely lies in the power 
of gauge symmetry. One of the most remarkable features of Nature is that its 
basic laws have great unity and symmetry when expressed in terms of group 
theory. Unification through gauge symmetry, apparently, is one of the great 
lessons of physics. In particular, the use of local symmetries in Yang—Mills 
theories has had enormous success in banishing the infinities of quantum field 
theory and in unifying the laws of elementary particle physics into an elegant 
and comprehensive framework. Nature, it seems, does not simply incorporate 
symmetry into physical laws for aesthetic reasons. Nature demands symmetry. 

The problem has been, however, that even the powerful gauge symmetries 
of Yang—Mills theory and the general covariance of Einstein’s equations are 
insufficient to yield a finite quantum theory of gravity. 

At present, the most promising hope for a truly unified and finite description 
of these two fundamental theories is superstring theory and its latest formu- 
lation, M-theory. [1—12]. Superstrings possess by far the largest set of gauge 
symmetries ever found in physics, perhaps even large enough to eliminate all 
divergences of quantum gravity. Not only does the superstring’s symmetry in- 
clude that of Einstein’s theory of general relativity and the Yang—Mills theory, 
it also includes supergravity and the Grand Unified Theories (GUTs) [13] as 
subsets. 

Roughly speaking the way in which superstring theory solves the riddle of 
infinities can be visualized as in Fig. 1.1, where we calculate the scattering 
of two point particles by summing over an infinite set of Feynman diagrams 
with loops. These diagrams, in general, have similarities that correspond to 
“pinching” one of the internal lines until the topology of the graph is altered. 


FIGURE 1.1. Single-loop Feynman diagram for four-particle scattering. The ultra- 
violet divergence of this diagram corresponds to the pinching of one internal leg, 
1.e., when one internal line shrinks to a point. 
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FIGURE 1.2. Two-loop Feynman diagram for closed string scattering. The diagram 
is ultraviolet finite because it cannot be pinched as in the point particle case. From 
topological arguments alone, we can see that string theory is less divergent than point 
particle theory. Infrared divergences, however, may still exist. 


By contrast, in Fig. 1.2 we have the single-loop contribution to the scattering 
of two closed string states. Notice that we cannot “pinch” one of the internal 
lines as in the point particle case. Thus, we naively expect that the superstring 
theory is less divergent or even finite because of the symmetries that forbid 
this topological deformation. 

Any theory that can simultaneously eliminate the infinities of the S-matrix 
and incorporate quantum mechanics, the general theory of relativity, GUT 
theory, and supergravity obviously possesses mathematics of breathtaking 
beauty and complexity. In fact, even the mathematicians have been startled 
at the mathematics emerging from the superstring theory, which links together 
some of the most dissimilar, far-ranging fields of mathematics, such as Kac— 
Moody algebras, Riemann surfaces and Teichmiiller spaces, modular groups, 
and even Monster group theory. . 

The great irony of string theory, however, is that the theory itself is not 
unified. To someone learning the theory for the first time, it is often a frustrating 
collection of folklore, rules of thumb, and intuition. At times, there seems to 
be no rhyme or reason for many of the conventions of the model. For a theory 
that makes the claim of providing a unifying framework for all physical laws, 
it is the supreme irony that the theory itself appears so disunited! The secrets 
of the model, at its most fundamental level, are still being pried loose. 

Usually, when we write down a quantum theory, we start with the geometry 
or symmetry of the theory and then write down the action. From the action, in 
turn, we derive all the predictions of the model, including the unitary S-matrix. 
Thus, a second quantized action is the proper way in which to formulate a 
quantum field theory. The fundamental reason why superstring theory seems, 
at times, to be a loose collection of apparently random conventions is that it is 
usually formulated as a first quantized theory. Because of this, we must appeal 
to intuition and folklore in order to construct all the Feynman diagrams for 
a unitary theory. We hope that M-theory (or perhaps evern a more advanced 
theory) will be able to unify superstring theory into a simple, coherent formal- 
ism. Already, M-theory can unify the five different superstring theories into 
a single theory. Ultimately, there may be a single equation (perhaps no more 
than an inch long) which will unify the entire theory. 
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Unfortunately, the geometry of the superstring and membranes are some 
of the last features of the model to be developed. In fact, as seen from this 
perspective, the model has been developing backward for the past 30 years, 
beginning with the accidental discovery of its quantum theory in 1968! 

By contrast, when Einstein first discovered general relativity, he started with 
physical principles, such as the equivalence principle, and formulated it in the 
language of general covariance. Once the geometry was established, he then 
wrote down the action as the unique solution to the probiem. Later, classical 
solutions to the equations were discovered in terms of curved manifolds, which 
provided the first successful theoretical models for the large-scale behavior of 
the Universe. Finally, the last step in the evolution of general relativity is the 
development of a quantum theory of gravity. The crucial steps in the historical 
evolution of general relativity can thus be represented as 


Geometry — Action > Classical theory — Quantum theory. 


Furthermore, both general relativity and Yang—Mills theory are mature 
theories: they both can be formulated from first principles. which stresses 
the geometry and the physical assumptions underlying the theory. Superstring 
theory and M-theory are just beginning to reach that stage of development. 

Remarkably, Yang—Mills theory and gravity theory are the waique solution 
to two simple geometric statements: 


(1) Global Symmetry 
The free theory must propagate pure ghost-free spin-| and spin-2 fields 
transforming as irreducible representations of SU(N) and the Lorentz 
group. 

(2) Local Symmetry 
The theory must be locally SU(N) and generally covariant. 


What is remarkable is that the coupled Yang-Mills gravity action is the 
unique solution of these two simple principles 


J l v 
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(The first principle contains the real physics of the theory. It cannot be 
included as a subset of the second principle. There is an infinite number of 
generally covariant and SU(N) symmetric invariants, so we need the first prin- 
ciple to input the physics and select the irreducible representations of the 
basic fields. By “pure” fields, we mean ghost-tree fields that have at most two 
derivatives, which rules out R° and F* higher derivatives theories.) 

The question remains: What is the counterpart to these nvo simple principles 
for superstring theory and M-theory? 

The plan of this book, of course, must reflect the fact that the theory has 
been evolving backward. For pedagogical reasons, we will mostly follow the 
historical development of the theory. Thus, Part I of the book, which introduces 
the first quantized theory, will at times appear to be a loose collection of 
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conventions without any guiding principle. That is why we have chosen, in Part 
1, to emphasize the path integral or functional approach to string theory. Only 
with Feynman path integrals do we have a formalism in which we can derive 
the other formalisms, such as the harmonic oscillator formalism. Although the 
path integral formulation of a first quantized theory is still woefully inadequate, 
compared to a genuine second quantized theory, it is the most convenient 
formalism in which to tie together the loose ends of the first quantized theory. 

In Part II of the book we will discuss the field theory itself, from which we 
can derive all the results of the theory from one action. However, once again 
we have followed historical order and presented the field theory backward. 

In Part III we present the “phenomenology” of strings. Although it may be 
presumptuous to do phenomenology starting at 10!? GeV, it is important to 
establish the kinds of predictions that the theory makes. 

Finally, in Part IV, we present duality and 1 1-dimensional M-theory, which 
allows us, for the first time, to probe the nonperturbative region of string theory, 
where the theory coexists with exotic objects like p-branes and D-branes. 

However, to really appreciate the successes and possible defects of the su- 
perstring theory, we must first try to understand the historical problems that 
have plagued physicists for the past five decades. Let us now turn to a quick 
review of the development of gauge theories in order to appreciate the diffi- 
culty of constructing a finite theory of gravity. We will also briefly sketch the 
historical development of the superstring theory. 


1.2 Historical Review of Gauge Theory 


In the 1960s, elementary particle physics seemed hopelessly mired in confu- 
sion. The weak, electromagnetic, strong, and gravitational forces were each 
studied seperately, largely in isolation of the others. Moreover, investigations 
into each force had reached a fundamental roadblock: 


(1) The weak interactions. Theoretical models of the weak interactions had 
progressed embarrassingly little beyond the Fermi theory first proposed 
three decades earlier in the 1930s: 


Leer i ee Wiel aw. leZe IIs) 


where the [’4 represents various combinations of Dirac matrices. The next 
major step, a theory of W bosons, was plagued with the problem of in- 
finities. Furthermore, no one knew the underlying symmetry among the 
leptons, or whether there was any. 
(2) The strong interactions. In contrast to weak interactions, the Yukawa 
’ meson theory provided a renormalizable theory of the strong interactions: 


Lyukawa ae ew Wo. (22) 
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However, the Yukawa theory could not explain the avalanche of “ele- 
mentary” particles that were being discovered in particle accelerators. 
J. Robert Oppenheimer even suggested that the Nobel Prize in Physics 
should go to the physicist who didn’t discover a particle that year. Fur- 
thermore, the quark model, which seemed to fit data much better than it 
had any right to, was plagued with the fact that quarks were never seen 
experimentally. 

(3) The gravitational force. Gravity research was totally uncoupled from re- 
search in the other interactions. Classical relativists continued to find more 
and more classical solutions in isolation from particle research. Attempts 
to canonically quantize the theory were frustrated by the presence of the 
tremendous redundancy of the theory. There was also the discouraging re- 
alization that even if the theory could be successfully quantized, it would 
still be nonrenormalizable. 


This bleak landscape changed dramatically in the early 1970s with the com- 
ing of the gauge revolution. One of the great achievements of the past 25 years 
has been the development of a fully renormalizable theory of spin-| gauge par- 
ticles in which, for the first time, physicists could actually calculate realistic 
S-matrix elements. Thus, it took over 100 years to advance beyond the original 
gauge theory first proposed by Maxwell in the 1860s! (See the Appendix for 
an elementary introduction to gauge theories and group theory.) 

Apparently the key to eliminating the divergences of relativistic quantum 
mechanics is to go to larger and more sophisticated gauge groups. Symmetry, 
instead of being a purely aesthetic feature of a particular model, now becomes 
its most important feature. 

For example, Maxwell’s equations, which provided the first unification of 
the electric force with the magnetic force, has a gauge group given by U(1). 
The unification of the weak and electromagnetic forces into the electroweak 
force requires SU(2) ® U(1). The forces that bind the quarks together into 
the hadrons, or quantum chromodynamics (QCD), are based on SU(3). All of 
elementary particle physics, in fact, is compatible with the minimal theory of 
SU(3) ® SU(2) ® U(1). 

Although the verdict is still not in on the GUTs, which are supposed to unite 
the electroweak force with the strong force, once again the unifying theme is 
gauge symmetry, with such proposals as SU(5), O(10), ete., symmetry. 

Although the gauge revolution is perhaps one of the most important devel- 
opments in decades, it is still not enough. There is a growing realization that 
the Yang-Mills theory by itself cannot push our understanding of the physical 
universe beyond the present level. Not only do the GUTs fail to explain im- 
portant physical phenomena, but also there is the crucially important problem 
of formulating a quantum theory of gravity. 

Grand Unified Theories, first of all, cannot be the final word on the 
unification of all forces. There are several features of GUTs that are still 
unresolved: 
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(1) GUTs cannot resolve the problem of why there are three nearly exact 
copies or families of elementary particles. We still cannot answer Rabi’s 
question, “Who ordered the muon?” 

(2) GUTs still have 20 or so arbitrary parameters. They cannot, for example, 
calculate the masses of the quarks, or the various Yukawa couplings. A 
truly unified theory should have at most one arbitrary parameter. 

(3) GUTs have difficulty solving the hierarchy problem. Unless we appeal 
to supersymmetry, it is hard to keep the physics of incredibly massive 
particles from mixing with everyday energies and destroying the hierarchy. 

(4) The unification of particle forces occurs around 107-7 cm which is very 
close to the Planck length of 10-7? cm, where we expect gravitational 
effects to become dominant. Yet GUTs say nothing whatsoever about 
gravitation. 

(5) So far, proton decay has not been conclusively observed, which al- 
ready rules out minimal SU(5). There is, therefore, still no compelling 
experimental reason for introducing the theory. 

(6) It is difficult to believe that no new interactions will be found between 
present-day energies and the unification scale. The “desert” may very well 
bloom with new interactions yet unknown. 


The most perplexing and the most challenging of these problems, from 
a foundational point of view, has been to find a way of quantizing Einstein’s 
theory of general relativity. Although Yang—Mills theories have had spectacular 
successes in unifying the known laws of particle physics, the laws of gravity 
are curiously different at a fundamental level. Clearly, Yang—Mills theory and 
conventional gauge theory are incapable of dealing with this problem. Thus, 
GUTs are faced with formidable experimental and theoretical problems when 
pushed to their limits. 

General relativity is also plagued with similar difficulties when pushed to 
its limits: 


(1) Classically, it has been established that Einstein’s equations necessarily 
exhibit pointlike singularities, where we expect the laws of general rela- 
tivity to collapse. Quantum corrections must dominate over the classical 
theory in this domain. 

(2) The action is not bounded from below, because it is linear in the curvature 
tensor. Thus, it may not be stable quantum mechanically. 

(3) General relativity is not renormalizable. Computer calculations, for exam- 
ple, have now conclusively shown that there is a nonzero counterterm in 
Einstein’s theory at the two-loop level. 


Naive attempts to quantize Einstein’s theory of gravitation have met with 
disappointing failure. One of the first to point out that general relativity 
would be incompatible with quantum mechanics was Heisenberg, who noted 
that the presence of a dimensional coupling constant would ruin the usual 
renormalization program. 
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If we set 
h — 
ea 
there still remains a dimensional constani even in the Newtonian theory of 
gravity, the gravitational constant G: 


o= I. (1.2.3) 


a (1.2.4) 
r 

which has dimensions of centimeters squared. When we power expand the 
metric tensor g,,,, around a flat square with the metric n,, =(— + + +), we 

introduce the coupling constant «, which has dimensions of centimeters: 
Suv =p lays (12.59 

Therefore 

G~k’. (1.2.6) 


In this system of units, where the only unit is the centimeter, this coupling 
constant « becomes the Planck Jength, 1077 cm or 10'° GeV, which is far 
beyond the reach of experimentation! 

Renormalization theory, however, is founded on the fundamental premise 
that we can eliminate all divergences with an infinite redefinition of certain 
constants. Having a negative dimensional coupling constant means that this 
complicated reshuffling and resuming of graphs is impossible. Negative cou- 
pling constants mean that we can always insert the interaction term into a 
Feynman diagram and increase its power of divergence. This means that any 
graph can be made arbitrarily divergent by multiple insertions. This means 
that general relativity cannot be a renormalizable theory. The amplitude for 
graviton—graviton scattering, for example, is now a power expansion in a 
dimensional parameter (see Fig. 1.3): 


co 


A= ean, a2 


n=2 


where we are no longer able to shuffle graphs in the usual manner to cancel the 
infinities, which is the heart of renormalization theory. Thus, renormalization 
theory breaks down. 

Because gencral relativity is hopelessly outside the domain of conventional 
renormalization theory, we must reconsider Dirac’s fundamental objection. It 
was Dirac who said that the success of quantum mechanics was based on ap- 
proximation schemes where each correction term was increasingly small. But 
renormalization theory is flawed because it maximally violates this principle 
and manipulates infinite quantities and discards them at the end. 

One solution might be to construct a theory of gravity that is finite to every 
order in the coupling constant, with no need for renormalization at all. For a 
while, one bright hope was supergravity [14, 15], based on the local gauge 
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FIGURE 1.3. Scattering amplitude for graviton-graviton scattering. Because the 
coupling constant has negative dimension any graph can be made arbitrarily diver- 
gent, thereby requiring an infinite number of counterterms. Thus, theories containing 
quantum gravity must be either divergent or completely finite order-by-order. Pure 
quantum gravity has been shown on computer to diverge at the two-loop level. Coun- 
terterms have also been found for quantum gravity coupled to lower-spin particles. 
Thus, superstring theory is the only candidate for a finite theory. 


group Osp(NV/4) (see Appendix), which was the first nontrivial extension of 
Einstein’s equations in 60 years. The hope was that this gauge group would 
offer us a large enough set of Ward—Takahashi identities to cancel a large class 
of divergent diagrams. The larger the gauge group, the more likely troublesome 
infinities would cancel (see Fig. 1.4): 


Theory Gauge group 
Electromagnetism U(1) 
Electroweak SU(2) @ U0) 
Strong SU(3) 

GUT(?) SU(5), O10) 
Gravity(?) GL(4), O(3, 1) 


Supergravity(?) Osp(N /4) 


The basic strategy being pursued was 


“Gauge symmetry — Ward—Takahashi identities 
— Cancellation of graphs —~ Renormalizable theory. 


For example, even Einstein’s theory of gravity can be shown to be triv- 
ially finite at the first loop level. There exists a remarkable identity, called the 
Gauss—Bonnet identity, which immediately shows that all one-loop graphs in 
general relativity (which would take a computer to write down) sum to zero. 
In fact, the super-Gauss—Bonnet identities eliminate many of the divergences 
of supergravity, but probably not enough to make the theory finite. 

The largest and most promising of the supergravities, the O(8) supergravity, 
is probably divergent. Unfortunately, it is possible to write down locally super- 
symmetric counterterms at the seventh loop level. It is highly unlikely that the 
coefficients of this and probably an infinite number of other counterterms can 
all vanish without appealing to an even higher symmetry. This is discourag- 
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Electricity 


U(1) 


Magnetism SU (2) & U(1) 


Weak Force SU (5), O(10) ? 


Strong Force Superstrings ? 


Gravitation 


FIGURE 1.4. Chart showing how gauge theories based on Lie groups have united the 
fundamental forces of Nature. Maxwell’s theory, based on U(1). unites electricity 
and magnetism. The Weinberg—Salam model, based on SU(2) @ U(1), unites the 
weak force with the electromagnetic force. GUTs (based on SU(5), O(10), or larger 
groups) are the best candidate to unite the strong force with the electroweak force. 
Superstring theory is the only candidate for a gauge theory that can unite gravity 
with the rest of the particle forces. 


ing, because it means that the gauge group of the largest supergravity theory. 
Osp(8/4), is still too small to eliminate the divergences of general relativity. 

Furthermore, the O(8) gauge group is too small to accommodate the minimal 
SU(3) ® SU(2) ® U(1) of particle physics. If we go to higher groups beyond 
O(8), we find that we must incorporate higher and higher spins into the theory. 
However, an interacting spin-3 theory is probably not consistent, making one 
suspect that O(8) is the limit to supergravity theories. 

In conclusion, supergravity must be ruled out for two tundamental reasons: 


(1) It is probably not a finite theory because the gauge group 1s not large 
enough to eliminate all possible supersymmetric counterterms. There is a 
possible counterterm at the seventh loop level. 

(2) Its gauge group O(8) is not large enough to accommodate the minimal 
symmetry of particle physics, namely, SU(3) @ SU(2) ® U(1): nor can the 
theory accommodate chiral fermions. 


Physicists, faced with these and other stumbling blocks over the years, have 
concluded that perhaps one or more of our cherished assumptions about our 
Universe must be abandoned. Because general relativity and quantum me- 
chanics can be derived from a small set of postulates, one or more of these 
postulates must be wrong. The key must be to drop one of our commonsense 
assumptions about Nature on which we have constructed general relativity and 
quantum mechanics. Over the years, several proposals have been made to drop 
some of our commonsense notions about the Universe: 


(1) Continuity 
This approach assumes that spacetime must be granular. The size of these 
grains would provide a natural cutoff for the Feynman integrals, allowing 


(2) 


(3) 
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us to have a finite S-matrix. Integrals like 


/ d*x AQ) 


would then diverge as e~”, but we would never take the limit as ¢ goes to 
zero. Lattice gravity theories are of this type. In Regge calculus [16], for 
example, we latticize Riemannian space with discrete four-simplexes and 
replace the curvature tensor by the angular deficit calculated when moving 
in a circle around a simplex: 


1 
—3 av 8k —» angular deficit. 


(In flat space, there is no angular deficit when walking around a closed 
path, and the action collapses.) Usually, in lattice theories, we take the 
limit as the lattice length goes to zero. Here, however, we keep it fixed at a 
small number [17]. At present, however, there is no experimental evidence 
to support the idea that space-time is granular. Although we can never 
rule out this approach, it seems to run counter to the natural progression 
of particle physics, which has been to postulate larger and more elegant 
groups. 

Causality 

This approach allows small violations in causality. Theories that incor- 
porate the Lee—Wick mechanism [18] are actually renormalizable, but 
permit small deviations from causality. These theories make the Feynman 
diagrams converge by adding a fictitious Pauli—Villars field of mass M that 
changes the ultraviolet behavior of the propagator. Usually, the Feynman 
propagator converges as p~ in the ultraviolet limit. However, by adding 
a fictitious particle, we can make the propagator converge even faster, like 


pm 


d i 
ie see (1.2.9) 


ptm p?+ M? ps 
Notice that the Pauli—Villars field is a ghost because of the —| that appears 
in the propagator. (This means that the theory will be riddled with negative 
probabilities.) Usually, we let the mass of the Pauli—Villars field tend to 
infinity. However, here we keep it finite, letting the pole go out onto the 
unphysical sheet. Investigations of the structure of the resulting Feynman 
diagrams show, however, that causality is violated; that is, you can meet 
your parents before you are born. 
Unitarity 
We can replace Einstein’s theory, which is based on the curvature tensor, 
with a conformal theory based on the Weyl tensor: 


eRe > 6) EC oe (1.2.10) 
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where the Wey] tensor is defined as 
Curpo = Ruvoo + 8uto Row + 8v{o Row + a ReupSalv (1.2.11) 


where the brackets represent antisymmetrization. The conformal tensor 
possesses a larger symmetry group than the curvature tensor, that is, 
invariance under local conformal transformations: 


Buy > &Buw (1.2.12) 


Bee mae Coe: 

The Weyl theory converges because the propagators go as p “; that is, it 
is a higher derivative theory. However, there is a “unitary ghost” that also 
appears with a — | in the propagator, for the same reasons cited above. The 
most optimistic scenario would be to have these unitary ghosts “confined” 
by a mechanism similar to quark confinement [19, 20]. 

(4) Locality 
Over the years, there have also been proposals to abandon some of the 
important postulates of quantum mechanics, such as locality. After all, 
there is no guarantee that the laws of quantum mechanics should hold 
down to distances of 10733 em. However, there have always been problems 
whenever physicists tried to deviate from the laws of quantum mechanics, 
such as causality. At present, there is no successful alternative to quantum 
mechanics. 

(5) Point Particles 
Finally, there is the approach of superstrings, which abandons the concept 
of idealized point particles, first introduced 2000 years ago by the Greeks. 


The superstring theory, because it abandons only the assumption that the fun- 
damental constituents of matter must be point particles. does the least amount 
of damage to cherished physical principles and continues the tradition of in- 
creasing the complexity and sophistication of the gauge group. Superstring 
theory does not violate any of the laws of quantum mechanics, yet manages 
to eliminate most, if not all, of the divergences of the Feynman diagrams. 
The symmetry group of the superstring model. the largest ever encountered 
in the history of physics, is probably large enough to make the theory finite 
to all orders. Once again, it is symmetry, and not the breakdown of quantum 
mechanics, that is the fundamental key to rendering a theory finite. 

In Fig. 1.5 we sce diagrammatically the evolution of various theories of 
gravity. First, there was Newton's theory of action at a distance, where gravi- 
tational interactions travel faster than the speed of light. Einstein replaced this 
with the classical interpretation of curved manifolds. Quantum gravity, in turn, 
makes quantum corrections to Einstein's theory by adding in loops. Finally. 
the superstring theory makes further corrections to the point particle quantum 
theory by summing over all possible topological configurations of interacting 
strings. 
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Newton Einstein Quantum Gravity 


— ————— ee dao 6 


Superstrings 


FIGURE 1.5. Steps in the evolution of the theory of gravitation. Each step in this 
chart builds on the successes of the previous step. Newton thought gravity was a 
force that acted instantly over a distance. Einstein proposed that gravitation was 
caused by the curvature of space-time. The naive merger of general relativity and 
quantum mechanics produces a divergent theory, quantum gravity, which assumes 
that gravitation is caused by the exchange of particle-like gravitons. Superstring 
theory proposes that gravitation is caused by the exchange of closed strings. 


Superstring theory, however, is quite unlike its predecessors in its historical 
development. Unlike other physical theories, superstring theory has perhaps 
one of the strangest histories in science, with more twists and turns than a roller 
coaster. 

First, two young physicists Veneziano and Suzuki [21, 22], independently 
discovered its quantum theory when they were thumbing through a math- 
ematics book and accidentally noted that the Euler Beta function satisfied 
all the postulates of the S-matrix for hadronic interactions (except unitarity). 
Neveu, Schwarz, and Ramond [23-25] quickly generalized the theory to in- 
clude spinning particles. To solve the problem of unitarity, Kikkawa, Sakita, 
and Virasoro [26] proposed that the Euler Beta function be treated as the Born 
term to a perturbation series. Finally, Kaku, Yu, Lovelace, and Alessandrini 
[27-33] completed the quantum theory by calculating bosonic multiloop dia- 
grams. The theory, however, was still formulated entirely in terms of on-shell 
S-matrix amplitudes. 

Next, Nambu and Goto [34, 35] realized that lurking behind these scattering 
amplitudes was a classical relativistic string. In one sweep, they revolution- 
ized the entire theory by revealing the unifying, classical picture behind the 
theory. The relationship between the classical theory and the quantum theory 
was quickly made by Goldstone, Goddard, Rebbi, and Thorn [36] and further 
developed by Mandelstam [37]. The theory, however, was still formulated as a 
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first quantized theory, so that the measure, the vertices, the counting of graphs, 
etc., all had to be postulated ad hoc and not deduced from first principles. 

The action (in particular gauge) was finally written down by Kaku and 
Kikkawa [38]. At last, the model could be derived from one action strictly in 
terms of physical variables, although the action did not have any symmetries 
left. However, when it was discovered that the theory was defined only in 10 and 
26 dimensions, the model quickly died. Furthermore, the rapid development 
of QCD as a theory of hadronic interactions seemingly put the last nail in the 
coffin of the superstring. 

For 10 years, the model languished because no one could believe that a 
10- or 26-dimensional theory had any relevance to four-dimensional physics. 
When Scherk and Schwarz [39] made the outrageous (for its time) suggestion 
that the dual model was actually a theory of all known interactions, no one 
took the idea very seriously. The idea fell like a lead balloon. 

The discovery in 1984 by Green and Schwarz [40] that the superstring theory 
is anomaly-free and probably finite to all orders in perturbation theory has 
revived the theory. The E; ® E3 “heterotic string” of Gross, Harvey, Martinec, 
and Rohm [41] was considered to be the best candidate for unifying gravity 
with physically reasonable models of particle interactions. 

One of the active areas of research now is to complete the evolution of the 
theory, to discover why all the “miracles” occur in the model. There has been 
a flurry of activity in the direction of writing down the covariant action using 
methods discovered in the intervening 10 years, such as BRST. 

Finally, intensive work on duality in the early 1990s, and especially the work 
of Witten and Townsend in1995, established M-theory as the latest, most ad- 
vanced formulation of string theory. The five superstring theories could now be 
unified into a single theory in 11 dimensions, perhaps in terms of membranes. 
Furthermore, the vast web of dual relationships in lower dimensions has given 
us a “road map” by which to probe nonperturbative solutions to string theory. 
What is still missing, however, is the geometry on which the theory ultimately 
lies. Understanding this geometry may allow us to formulate the entire theory 
into a simple, coherent equation. This could complete the evolution of the 
theory, which has been evolving backward for the past 30 years. 


Quantum theory — Classical theory — Action + Geometry. 


Let us summarize some of the promising positive features of the superstring 
model: 


(1) The gauge group includes Ex ® Eg which is much larger than the minimal 
group SU(3)@ SU(2) @ U(1). There is plenty of room for phenomenology 
in this theory. 

(2) The theory has no anomalies. These small but important defects in a quan- 
tum field theory place enormous restrictions on what kinds of theories are 
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self-consistent. The symmetries of the superstring theory, by a series of 
“miracles,” can cancel all of its potential anomalies. 

(3) Powerful arguments from the theory of Riemann surfaces indicate that the 
theory is finite to all orders in perturbation theory (although a rigorous 
proof is still lacking). 

(4) There is very little freedom to play with. Superstring models are no- 
toriously difficult to tinker with without destroying their miraculous 
properties. Thus, we do not have the problem of 19 arbitrary coupling 
constants. 

(5) The theory includes GUTs, super-Yang—Mills, supergravity, and Kaluza— 
Klein theories as subsets. Thus, many of the features of the phenomenology 
developed for these theories carry over into the string theory. 


Superstring theory, crudely speaking, unites the various forces and particles 
in the same way that a violin string provides a unifying description of the 
musical tones. By themselves, the notes A, B, C, etc., are not fundamental. 
However, the violin string is fundamental; one physical object can explain 
the varieties of musical notes and even the harmonies we can construct from 
them. In much the same way, the superstring provides a unifying description of 
elementary particles and forces. In fact, the “music” created by the superstring 
is the forces and particles of Nature. 

Although superstring theory, because of its fabulously large set of symme- 
tries, has ‘“‘miraculous” cancellations of anomalies and divergences, we must 
also present a balanced picture and point out its shortcomings. To be fair we 
must also list the potential problems of the theory that have been pointed out 
by critics of the model: 


(1) It is impossible experimentally to reach the tremendous energies found 
at the Planck scale. Therefore, the theory is in some sense untestable. A 
theory that is untestable is not an acceptable physical theory. 

(2) Not one shred of experimental evidence has been found to confirm the 
existence of supersymmetry, let alone superstrings. 

(3) It is presumptuous to assume that there will be no surprises in the “desert” 
between 100 and 10!° GeV. New, totally unexpected phenomena have al- 
ways cropped up when we have pushed the energy scale of our accelerators. 
Superstring theory, however, makes predictions over the next 17 orders of 
magnitude, which is unheard of in the history of science. 

(4) The theory does not explain why the cosmological constant is zero. Any 
theory that claims to be a “theory of everything” must surely explain 
the puzzle of a vanishing cosmological constant, but it is not clear how 
superstrings solve this problem. 

(5) The theory has an embarrassment of riches. There are apparently millions 
of ways to break down the theory to low energies. Which is the correct 
vacuum? Although the superstring theory can produce the minimal theory 
of SU(3) ® SU(2) @ U()), it also predicts many other interactions that have 
not yet been seen. 
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(6) No one really knows how to break a 10-dimensional theory down to four 
dimensions. 


Of these six objections to the model, the most fundamental is the last, the 
inability to calculate dimensional breaking. The reason for this is simple: to ev- 
ery order in perturbation theory, the dimension of space-time is stable. Thus, in 
order to have the theory spontaneously curl up into four- and six-dimensional 
universes, we must appeal to nonperturbative, dynamical effects, which are 
notoriously difficult to calculate. This is why the search for the geometry under- 
lying the theory is so important. The geometric formulation of the model may 
give us the key insight into the model that will allow us to make nonperturbative 
calculations and make definite predictions with the theory. 

Thus, the criticism that the model cannot be tested at the Planck length is 
actually slightly deceptive. The superstring theory, if it could be successfully 
broken dynamically, should be able to make predictions down to the level of 
everyday energies. For example, it should be able to predict the masses of the 
quarks. Therefore, we do not have to wait for several centuries until we have 
accelerators that can reach the Planck length. 

Thus, the fundamental problem facing superstrings is not necessarily an 
experimental one. It is mainly theoretical. The outstanding problem of the 
theory is to calculate dynamical symmetry breaking, so that its predictions 
can be compared with experimental data at ordinary energies. 

A fundamental theory at Planck energies is also a fundamental theory at 
ordinary energies. Thus, the main stumbling block to the development of the 
theory is an understanding of its nonperturbative behavior. 

In Part I of this book, however, we will follow historical precedent and 
present the first quantized formulation of the model. As we will stress through- 
out this book, the first quantized theory seems to be a loose collection of random 
facts. As a consequence, we have emphasized the path integral formulation 
(first written down for the Veneziano model by Hsue, Sakita. and Virasoro [42. 
43]) as the most powerful method of formulating the first quantized theory. 
Although the path integral approach cannot reveal the underlying geometric 
formulation of the modcl, it provides the most comprehensive formulation of 
the first quantized theory. 

We will not turn to the functional formulation [44] of point particle theory, 
which can be incorporated almost directly into the string theory. 


1.3. Path Integrals and Point Particles 


Let us begin our discussion by analyzing the simplest of all possible systems, 
the classical nonrelativistic point particle. Surprisingly, much of the analysis of 
this simple dynamical system carries over directly to the superstring theory. The 
language we will use is the formalism of path integrals, which is so versatile that 
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it can accommodate both first quantized point particles and second quantized 
gauge fields with equal ease. 

As in classical mechanics, the starting point is the Lagrangian for a point 
particle: 


L = mx? — V(x), c3.1) 


where the particle is moving in an external potential. The real physics is con- 
tained in the statement that the action § must be minimized. The equations of 
motion can be derived by minimizing the action: 


S = f H0, 3, t)dt, 
58 = 0. (1.3.2) 


To calculate the equations of motion, let us make a small variation in the path 
of the particle given by 


oxi, bX;. (153.3) 
Under this small variation, the action varies as follows: 
b6L 6L 
fa — dx; + —5x;} = 0. (1.3.4) 
dX; bX; 


Integrating by parts, we arrive at the Euler-Lagrange equations: 
— —-—— =0. (35) 


For our point particle, the equations of motion become 


d*x; aV(x) 
m = 
dt? OX; 
which correspond to the usual classical Newtonian equations of motion. 
In addition to the Lagrangian formulation of classical mechanics, there is 
also the Hamiltonian form. Instead of introducing the position and the velocities 
as fundamental objects, we now introduce the position and the momentum: 


bL 


(1.3.6) 


Ss 1.3.7) 
P bX; ( 
With this definition of the conjugate variable, we have 
H = p;x; — L, 
p? 
H(p;, x;) = =— + V(x). (1.3.8) 
2m 


Finally, the Poisson brackets between the momenta and the coordinates are 
given by 


[pi. Xj]pp = —45ij- (1.3.9) 
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A celebrated theorem in classical mechanics states that the equations of 
motion of Newton and the action principle method can be shown to be identical. 
Beginning with the action principle, we can derive Newton’s laws of motion, 
and vice versa 


Equations of motion <> Action principle. 


This equivalence, however, breaks down at the quantum level. Quantum 
mechanically, there is a fundamental difference between the two, with the 
equations of motion being only an approximation to the actual quantum be- 
havior of matter. Thus, the action principle is the only acceptable framework 
for quantum mechanics. 

Let us now reformulate the principles of quantum mechanics in terms of 
Feynman path integrals [44]: 


(1) The probability P(a, b) of a particle moving from point a to point b is the 
square of the absolute value of a complex number, the transition function 
Kala: 


P(a, b) = |K(a, b)|’. (1.3.10) 


(2) The transition function is given by the sum of a certain phase factor, which 
is a function of the action S, taken over all possible paths from a to b: 


Kia, by) =e (1.3.11) 


paths 


where the constant k can be fixed by 


K(a,c) = 9° K(a, b)K(b, ©), (1.3.12) 


paths 


and the intermediate sum is taken over paths that go through all possible 
intermediate points b. 


The second principle says that a particle “sniffs out” all possible paths from 
point a to point b, no matter how complicated the paths may be. We calculate 
this phase factor for each of this infinite number of paths. Then the transition 
factor for the path between a and / is calculated by summing over all possible 
phase factors (see Fig. 1.6). 

Remarkably, the essence of quantum mechanics is captured in these thvo 
principles. All the profoundly important implications of quantum mechanics, 
which represent a startling departure from classical mechanics, can be derived 
from these two innocent-sounding principles! In particular, these two princi- 
ples summarize the essence of the quantum interpretation of the double-slit 
experiment, which, in turn, summarizes the essence of quantum mechanics 
itself. 

It is apparent at this point that the results of classical mechanics can be 
reproduced from our two assumptions in a certain approximation. Notice that, 
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Classical Quantum 
Mechanics Mechanics 


FIGURE 1.6. The essential difference between classical mechanics and quantum 
mechanics. Classical mechanics assumes that a particle executes just one path be- 
tween two points based either on the equations of motion or on the minimization of 
the action. By contrast, quantum mechanics sums the contributions of probability 
functions (based on an action) for all possible paths between two points. Although 
the classical path is the one most favored, in principle all possible paths contribute to 
the path integral. Thus, the action principle is more fundamental than the equations 
of motion at the quantum level. 


for values of S that are large compared to Planck’s constant, the phase factor 
fluctuates rapidly, canceling out these contributions: 


h ; i2xS/h 
8S > 5: De >0. | (1.3.13) 


paths 


Thus, the only contributions to the path integral that survive are those for which 
the deviations in the action from the classical path are on the order of Planck’s 
constant: 


oe (1.3.14) 
. 20 
We see that the Euler-Lagrange equations of motion are reproduced only ina 
certain classical limit, that is, when Planck’s constant goes to zero. Therefore 
the size of Planck’s constant ultimately determines the probability that a par- 
ticle will execute trajectories that are forbidden classically. We see the origin 
of Heisenberg’s uncertainty principle embodied in these two principles. 
Now let us try to reformulate this more precisely in terms of path integrals. 
The second principle now reads 


b 
K(a, b) = i Dre (1.3.15) 


where 


K(a, a= | Ka.HKe, c)Dx, (1.3.16) 
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and 


SN 
Pe = [x= lim f | [|] ean. (1.3.17) 


paths cad f=lpn—| 


where the index n labels N intermediate points that divide the interval between 
the initial and the final coordinate. We will take the limit when NV approaches 
infinity. 

It is absolutely essential to understand that the integration Dx is not the 
ordinary integration over x. In fact, it is the product of all possible integrations 
over all intermediate points x; ,, between points a and b. This crucial difference 
between ordinary integration and functional integration goes to the heart of the 
path integral formalism. 

This infinite series of integrations, in turn, is equivalent to summing over all 
possible paths between a and b. Thus, we will have to be careful to include 
normalization factors when performing an integration over an infinite number 
of intermediate points. 

If we take the simple case where L = 4mX-?, all functional integrations can 
actually be performed exactly. The integral in question is a Gaussian, which is 
fortunately one of the small number of functional integrals that can actually be 
performed. One of the great embarrassments of the method of path integrals 
is that one of the few integrals that can actually be performed is 


a 2 nee 
/ TIES a aa _ ieee (1.3.18) 


2n+1 
oe) ae 


We will be using this formula throughout the entire book. 

Let us now break up the path tnto an infinite number of intermediate points, 
Xin. (Notice that the functional expression integrates over all possible values 
of the intermediate point .x;.,,, so we cannot expect that x,,, and x,,,.; are close 
to each other even for small time separations.) Let us write 


adi é 
le? i el | 1 
sMX} sIM(Xn —XasiE - (1.3.19) 
In order to perform the functional integral over an infinite number of 
intermediate points, we will repeatedly use the following Gaussian integration: 


ik dx exp[—a(x, — x2)’ — a(x. — x3)"] 


= 5 exp [-fa(xs — 25]. (1.3.20) 


One of the crucial points to observe here is that the integration over a Gaussian 
in one of the intermediate points yields another Gaussian with that interme- 
diate point removed. This is the fundamental reason why we can perform the 
functional integration over an infinite number of intermediate points. 
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Finally, the path integral that we wish to perform is given by 


(a,b) = lim ff» fax, dxs---dxy- 


Qmie\ 1/2)" mn 
2 
x ( fe ) exp ae Le — Xn_-1) (3215 


(where we have suppressed the vector index i). Using the previous relation 
(1.3.20), the final result is equal to 


i Lim(t, = Xa) 
exp (mea) ; (1.3722) 


th — ty 


m 


Saale Seema 


The transition probability function K has some very interesting properties. For 

example, it solves the wave equation: 
nace K(a, b) 9 KY b) (1.3.23) 
—— K(a, b) =i—K@(a, 3 
2m dx? ta 

when 1, is greater than fp. 

Later, we will generalize these expressions for the case of freely propagating 
strings, and we will find that these expressions for the Green’s functions carry 
over with only small, but important, changes. 

To show the relationship between the Hamiltonian and Lagrangian for- 
malisms in the path integral approach, it is helpful to insert a complete set 
of intermediate states when we divide up the path from a to b. Let us treat the 
variable x as an operator Xx acting on a set of eigenstates: 


ela) =x |) (13324) 
The |x) represents an eigenstate of the position operator, treating + as an 


operator whose eigenvalue is equal to the number x. Then completeness over 
eigenstates for coordinates and for momenta can be represented as 


= f tear Ge, 


1= | \p)dp (pl. (1.3.25) 
We normalize our states as follows: 
(x|y) = 5(x =), 
elPx 
(ples aay (1.3.26) 


(Because of the infinite number of normalization constants that constantly 
appear in the path integral formalism, we will often delete them for the sake of 
clarity in this book. We do not lose any generality, because we can, of course, 
reinsert them into the path integral if we desire.) 
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With these eigenstates, we can now rewrite the expression for the Green’s 
function for going from point x; to xy. 
RGN) = (x1, tilxy, ty). (13238 


In order to derive the previous expression (1.3.22) for transition amplitude, 
let us insert a complete set of intermediate states at every intermediate point 
between x, and xy: 


Oily, in) = bitin) f de bxa,to| f de 


selects) f dyn (ene ty-abtws ty): (1.3.28) 


Now let us examine each infinitesimal propagator in terms of the Hamiltonian, 
which we write as a function of the coordinates and derivatives: 
=o.) (133233 


Then the transition for an infinitesimal interval is given by 


(x | eT A(X. 8, )8t |x>) 


—iH(x,0;)8t ( 


(X1, ty |x2, ty) 


—e X1|X2) 


= em 808 Oxiip) f dp ( p|x2) 


=e 
20 


ee ad ‘| OP iis (1.3.30) 
20 
It is very important to notice that path integrals have made it possible to 
make the transition from classical to quantum commutators. The Hamiltonian 
can be expressed either as a function of derivatives with respect to the position 
or as a function of the canonical momenta because of the identity: 


d,e'?* = ipe!?*. (3.30) 
This allows us to make the important identification: 


H(x, p) > H(x, dx), 
(13532) 


P< -I1—. 
A One 
In the functional formalism, the important correspondence between momenta 
and partial derivatives arises because of this identity. 
Putting everything together, we can now write the complete transition 
amplitude as 


Vy Tn 
(seni) = | Dp Dx exp {i [ [px — H(p. oar » Gelese33) 
\y fh 
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where 
HA = —4 V(x). 
om + V(x) (1.3.34) 


(As usual, we have dropped all the intermediate normalizations, which are 
just factors of 27.) Notice that the functional integral, which was once only 
a function of the coordinates, is now a function of both the momenta and the 
coordinates. 

In order to retrieve the original Lagrangian, we can perform the p integration 
exactly, because it is a simple Gaussian integral, and we arrive at 


XN ty 

Goya iL Dx exp f / [4mx; — V(x)] ar| . (1.3.35) 
xX} t 

We have thus made the transition between the Lagrangian and the Hamiltonian 

formalism using functional methods. We can use either: 


L=imi?-Vax) @ H=—4+V(x). (1.3.36) 


Functionally, the only difference between these two expressions is whether 
we integrate over the coordinates or a combination of the coordinates and the 
momenta. The transition probability can be represented as 


K{a,b) = i: Dx exp f [ dt [4mx; — vo] 


Xp th iD 
=| pxbpexp|i | Fr |e vm] Can 
Xa la L 2m 


1.4 Relativistic Point Particles 


So far, our discussion has been limited to nonrelativistic particles, where all 
degrees of freedom are physical. However, nontrivial complications occur 
when we generalize our previous discussion to the case of relativistic par- 
ticles. In particular, the (—1) appearing in the Lorentz metric will, in general, 
cause nonphysical states to propagate in the theory. These nonphysical “ghost” 
states, which have negative probability, must be eliminated carefully to ensure 
a sensible causal theory free of negative norm states. 

For the relativistic case, let us assume that the location of a point particle is 
given by a four-vector: 


Xy(T), earl) 


where parametrization t does not necessarily refer to the time. The action is 
particulary simple, being proportional to the four-dimensional path length: 


S=-—-—m | ds = —m (length). (1.4.2) 
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The path length ds can be written in terms of the coordinates: 
SSS ale (1.4.3) 


where the dot refers to differentiation with respect to the parameter T. 
This action, unlike the previous nonrelativistic action, is invariant under 
reparametrizations of the fictitious parameter t. Let us make a change of 
coordinates from T to T: 


T > T(T). (1.4.4) 
Then we find 
“Ee = Spi 
dt 
es (1.4.5) 
dt dt dt 
1/2 a ie) 


2 2 
at) dt= (=) at. 
dt dt 


Thus, the action is invariant under an arbitrary reparametrization of the variable 
te 
This can be written infinitesimally as 


to T tons (1.46) 
OXg == rome — 
As before, we can now introduce canonical conjugates: 
6L mx 
P= eri =" (1.4.7) 
Q /-2 


The crucial difference, however, from our previous discussion of the non- 
relativistic point particle is that not all the canonical momenta are independent. 
In fact, we find a constraint among them: 


Pe +m’? =0. (1.4.8) 


Thus, the mass shell condition arises as an exact constraint among the momenta. 
If we calculate the Hamiltonian associated with this system, we find that 


H = p"x, —L=0. (1.4.9) 


The Hamiltonian vanishes identically. 

These unusual features, the vanishing of the Hamiltonian and the constraints 
among the momenta, are typical of systems with redundant gauge degrees of 
freedom. The invariance under reparametrization, for example, tells us that the 
path integral that we wrote earlier actually diverges: 


/ Dxe'’ = 00. (1.4.10) 
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This is because there is a separate contribution from each particular param- 
etrization. But since Dx is parametrization invariant, this means that we are 
summing over an infinite number of copies of the same thing. Thus, the integral 
must diverge. 

Dirac, however, explained how to quantize systems with redundant gauge 
degrees of freedom. For example, let us introduce canonical momenta p and 
impose the constraint condition via a Lagrange multiplier as follows: 


L= pix" — he( pi, +m’). (1.4.11) 


The constraint equation (1.4.8) is imposed here as a classical equation of 
motion. By varying e, we recover the constraint on the momenta. Quantum 
mechanically, however, this constraint is imposed by functionally integrating 
out over e. In the path integral, we have 


/ De exp E face + m’) ~ 8(p* +m’), (1.4.12) 
where we have used the fact that the integral over e’** (or the Fourier transform 


of the number 1) is equal to (x). Notice that the new Lagrangian (1.4.11) still 
possesses the gauge degree of freedom. It is invariant under 


0 = ae 
) ay On 

pee re). (1.4.13) 
dt 


The advantage that this action has over the previous one is that all variables 
occur linearly. We do not have to worry about complications caused by the 
square root. (The field e that we have introduced will become the metric tensor 
24» When we generalize this action to the string.) 

Let us now functionally integrate over the p variable. Because the integration 
is again a Gaussian, we have no problem in performing the p integration: 


| »» exp f if dt [px — te(p> + nm) 
~ exp {i f ari(e-ti2 —em?y} , (1.4.14) 
Thus, we have now obtained a third version of the point particle action. The 


advantage of this action is that it is linear in the coordinates and is invariant 
under 


Otel 
d(ee) (1.4.15) 
sZ= : 
ae 


In summary, we have found three equivalent ways to express the relativistic 
point particle. The “second-order” Lagrangian (1.4.14) is expressed in terms of 
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second-order derivatives in the variable x,,(t) and the field e. The “nonlinear” 
Lagrangian (1.4.3) is expressed only in terms of x,,(T). It can be derived from 
the second-order form by functionally integrating over the e field. And finally, 
the “Hamiltonian” form contains both x,,(t) and the canonical conjugate p(t) 
(it is first-order in derivatives): 


first-order (Hamiltonian) form: L = p,x" — Se( Di, mney 


second-order form: L = Se x) = em). (1.4.16) 


nonlinear form: L = | ee 


All three are invariant under reparametrization. Each of them has its own 
distinct advantages and disadvantages. This exercise in writing the action of 
the free relativistic particle in three different ways is an important one because 
it will carry over directly into the string formalism. Expressed in terms of path 
integrals, the point particle theory and the string theory are remarkably similar. 


1.5 First and Second Quantization 


In this section, we will quantize the classical point particle and then show 
the relationship to the more conventional second quantized formalism of field 
theory. The first quantization program, as we shall see, is rather clumsy com- 
pared to the second quantized formalism that most physicists are familiar with, 
but historically the string theory evolved as a first quantized theory. The great 
advantage of the second quantized formalism is that the entire theory can be 
derived from a single action, whereas the first quantized theory requires many 
additional assumptions. 

The transition from the classical to the quantum system is intimately linked 
with the question of eliminating redundant infinities. As we said before, the 
path integral is formally ill-defined because we are summing over an infinite 
number of copics of the same thing. The trick is to single out just one copy. 

There are at least three basic ways in which the first quantized point particle 
may be quantized: the Coulomb gauge, the Gupta -Bleuler formalism, and the 
BRST formalism. 


Coulomb Quantization 


Here, we choose the gauge 
=i =r. Cie) 


In other words, we set the time component of the x variable equal to the real 
time f, which now parametrizes the evolution of the string. In this gauge, the 
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action reduces to 
| i -m | 1—v? dt. (125.2) 


In the limit of velocities small compared to the velocity of light, we have 
L ~ 3mx? (153) 


as before, so that the functional integral is modified to 


[ 2x80 - ne! = f Dx exp (if mazar), (1.5.4) 


For the case of the string, this simple example will lay the basis for the light 
cone quantization. The advantage of the Coulomb gauge is that all ghosts have 
been explicitly removed from the theory, so we are dealing only with physical 
quantities. The other advantage is that the zeroth component of the position 
vector is now explicitly defined to be the time variable. The parametrization 
of the point particle is now given in terms of the physical time. 

The disadvantage of the Coulomb formalism, however, is that manifest 
Lorentz symmetry is broken and we have to check explicitly that the quan- 
tized Lorentz generators close correctly. Although this is trivial for the point 
particle, surprising features will emerge for the quantum string, fixing the 
dimension of space-time to be 26. 


Gupta—Bleuler Quantization 


This approach tries to maintain Lorentz invariance. This means, of course, that 
particular care must be taken to prevent the negative norm states from spoiling 
the physical properties of the S-matrix. The Gupta—Bleuler method keeps the 
action totally relativistic, but imposes the constraint (1.4.8) on state vectors: 


[p, +m’]|¢) =0. (1.5.5) 


(Notice that the above equation is a ghost-killing constraint, because we can 
use it to eliminate po.) This formalism allows us to keep the commutators fully 
relativistic: 


[Pus xv] = —inu, (1.5.6) 
where we choose n,,, = (— + ++ .---). Notice that this gauge constraint 
naturally generalizes to the Klein—Gordon equation: 

[O — m?)¢(x) = 0. (1.5.7) 


The Gupta—Bleuler formalism is an important one because most of the 
calculations in string theory have been carried out in this formalism. 
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BRST Quantization 


The advantage of the BRST formalism [45, 46] is that it is manifestly Lorentz- 
invariant. But instead of regaining unitarity by applying the gauge constraints 
on the Hilbert space, which may be quite difficult in practice, the BRST formu- 
lation uses the Faddeev—Popov ghosts to cancel the negative metric particles. 
Thus, although the Green’s functions are not unitary because of the propagation 
of negative metric states and ghosts, the final S-matrix is unitary because all 
the unwanted particles cancel among each other. Thus, the BRST formalism 
manages to incorporate the best features of both formalisms, 1.e., the mani- 
fest Lorentz invariance of the Gupta—Bleuler formalism and the unitarity of 
the Coulomb or light cone formalism. In order to study the BRST formalism, 
however, we must first understand Faddeev—Popov quantization. 


1.6 Faddeev—Popov Quantization 


Before we discuss the BRST method, it is essential to make a digression and 
review the formalism developed by Faddeev and Popov [47]. As we said earlier, 
the path integral measure Dx,, is ill-defined because it possesses a gauge degree 
of freedom, so we are integrating over an infinite number of copies of the same 
thing. Naively, we might insert the gauge constraint directly into the path 
integral. If the constraint is given by some function F of the fields being set to 
zero: 


F(x,) = 0, (1.6.1) 


then we insert this delta functional directly into the path integral: 
Pos | Dx | [otF@ule’®. (1.6.2) 


However, this naive approach is actually incorrect because the delta functional 
contributes a nontrivial measure to the functional integral. 

The key to the Faddeev Popov method is to insert the number | into the 
functional, which obviously has the correct measure. For our purposes, the 
most convenient formulation of the number 1 is given by 


ll = ave f Ded{ F(x) )]. (1.6.3) 


where € 1s the parametrization of the gauge symmetry of the coordinate, in 
(1.4.6), x7 is the variation of the field with respect to this symmetry, and the 
Faddeev Popov determinant Arp is defined by the previous equation. 

Notice that the integral appearing in the previous equation is an integration 
over all possible parametrizations of the ficld. Since we are integrating out 
over all possible parametrizations, then, by construction, the Faddeev—Popov 
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determinant is gauge independent of any particular parametrization: 
Arp(x) = Arp(x*). (1.6.4) 


Let us now insert the number | into the functional integral and make a gauge 
transformation to reabsorb the ¢ dependence in x: 


z= | Dxaro(x) f Desiree 
= / Dx Agp(x) | Ded[ F(x)}e’*. (1.6.5) 


Notice here the x* was gauge rotated back into the original variable x. Since 
all other parts of the functional integral were already gauge independent, we 
now have 


Z= il Ds| / Dx Agpd[F(x)]e’®. (1.6.6) 


We can now extract out the integral over the gauge parameter, which measures 
the infinite volume of the group space: 


volume = / De (26:7) 


and obtain a new expression for the functional which no longer has this infinite 
redundancy: 


Tie / Dx Appd[ F(x)]e!*. ) (1.6.8) 


Notice that a naive quantization of the path integral would simply insert the 
F constraint and would omit the Faddeev—Popov determinant, which is a new 
feature that makes the measure come out correctly. 

Now let us calculate the Faddeev—Popov determinant, which carries all the 
information concerning the ghosts of the theory. The trick is to change variables 
from ¢ to F. We can do this because both ¢ and F have the same number of 
degrees of freedom. Thus, the Jacobian can be calculated: 


OF 
det =| De=]DF., (1.6.9) 
bé 


We can therefore write 


i {foes} - { f orae{ Jom} 


= 
OF 
= fact | | | = act | | | (1.6.10) 
OF reo b€ | ro 


Thus, the Faddeev—Popov factor can be expressed as a simple determinant of 
the variation of the gauge constraint. It is more convenient to introduce this 
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factor directly into the action by exponentiating it. We use the following trick: 


App = / D6 Dée'*, (1.6.11) 
where the new ghost contribution to the action is given by 
Blige ok 
Ssh = [ara =| 6, (1.6.12) 
b€ | ro 


where the 6 variables are anticommuting c-numbers called Grassmann num- 
bers. Normally, when performing functional integrations, we expect to find the 
determinant of the inverse of a matrix. With functional integration over Grass- 
mann numbers, the determinant occurs in the numerator, not the denominator. 
Grassmann numbers have the strange property that 


6,0; = —9;9;. (1.6.13) 
In particular, this means 
6? =0. (1.6.14) 


Normally, this would mean that 6 vanishes. However, this is not the case for a 
Grassmann number. Thus, we also have the strange identity 


e® = 1460. (1.6.15) 


This identity makes the integration over exponentials of Grassmann-valued 
fields in the functional integral rather easy, because they are simply polyno- 
mials. More identities on Grassmann numbers are presented in the Appendix. 
where we show that 


N N 
[Va dé; aps fat = det(,,). (1.6.16) 
i=] 


ij=l 


This identity verifies that integration over Grassmann variables yields deter- 
minant factors in the numerator, not the denominator, so that we can express 
the Faddeev—Popov determinant in (1.6.11) as a Grassmann integral. 

Now that we have developed the apparatus of Faddeev -Popov quantiza- 
tion, Ict us return to the BRST approach, where we wish to impose the gauge 
condition 


e=1 (1.6.17) 


(we omit some subtleties with respect to this gauge). In this gauge. we should 
be able to recover the usual covariant Feynmann propagator. To show this. 
notice that our action (1.4.14) becomes 


L = 4(x2 —m’). (1.6.18) 
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Given this Lagrangian, our Green’s function for the propagation of a point 
particle from one point to another is now given by 


Artois 82) = Gail as baa) = (nlf dee" Ln) 


[o.) X2 1 T 
=| a | Dx exp -;[ dux any). (1-619) 
0 Xx} : Z 0 ig 


Notice that this is the usual covariant Feynman propagator rewritten in first 
quantized path integral language. 
Originally, before gauge fixing, our action was invariant under 
d(ée) 
yg : 6.20 

ae oa ) 
Thus, the Faddeev—Popov determinant associated with the gauge choice e = | 
is the determinant of the derivative. We now use a Gaussian integral over 
Grassmann states to represent the determinant, using (1.6.10): 


Arp = det |d,| = D6 D6 exp (: f ardo.a). (1.6.21) 


(If we had used ordinary real fields instead of Grassmann-valued fields, the 
determinant would have come out with the wrong power.) 

Putting everything together, we find that our final action can be represented 
as 


L = p,X" — (pi, +m’) —i60,0. — (1.6.22) 
The essence of the BRST approach is to notice that this gauge-fixed action has 
the additional symmetry: 
6X, = 180%, 
6Py = 16 Dy, 
50 = ic00, (1.6.23) 
80 = ic00 + Se(p,, +m’). 


At first, we may wonder why yet another symmetry appears after we have 
already fixed the gauge degree of freedom. However, this extra symmetry is 
global and hence does not allow us to impose any constraints on the theory. 
This symmetry, therefore, is different from the ones found earlier and cannot 
be used to eliminate gauge fields from the action. 

We can summarize the BRST approach by extracting an operator Q that will 
generate the symmetry found earlier: 


6g = [€Q, 9], 
Q =6(0-»m”), (1.6.24) 
Q? =0. 
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The physical states satisfy 
Op) — 0. (1.6.25) 


Notice that enforcing this constraint recovers the Klein—Gordon equation for 
on-shell particles: 


(CO — m’)¢ = 0. (1.6.26) 


1.7 Second Quantization 


So far, we have been analyzing only the first quantized approach to quantum 
particles. We have quantized only the position and momentum vectors: 


first quantization: [p;,x;] = —14;;. C7 


The limitations of the first quantized approach, however, will soon become 
apparent when we introduce interactions. Let us say that we wish to describe 
point particles that can bump into each other and split apart. rather than intro- 
duce an external potential. We must now modify the generating functional to 
include summing over Feynman graphs: 


T= Ne | Dye eh (iz) 


topologies 


(Notice that we have Wick rotated the t integration so that the exponential 
converges. It will be clear from the context when the Wick-rotated theory is 
being used in this book because the exponential becomes real. We will not 
discuss the delicate question of the convergence of path integrals.) 

In other words, we must, by hand, sum over the various particle topologies 
where point particles can split and reform. Each topology represents the history 
of the trajectories of the various point particles as they interact. The amplitude 
for N-particle scattering, with momenta given by A), A>, .... Ay. can now be 
represented as 


A(ki, ko, -.-, kv) = 2 2" | Dxdm 


topologies 


N 

x exp { — | Cs LR eet ai Oc 
Notice that we are taking the Fourier transform of the Green’s function, so that 
the amplitude 1s a function of the external momenta. This formula can be more 
conveniently represented as 


N 


Av= > g"(exp pax |). (1.7.4) 


topologies | 
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Thus, we associate a factor e** for each external particle coming from the 
Fourier transform term. This path integral formula for the scattering amplitude 
is important because it will carry over almost exactly into the string formalism. 

Notice how clumsy this description is. We must fix the set of all topologically 
allowed configurations and their weights by hand. Furthermore, unitarity of 
the S-matrix is not at all obvious. 

In a second quantized description, however, we introduce a field w(x) 
and quantization relations between the fields themselves, not between the 
coordinates: 


second quantization: [z(x), W(y)]m=y = —1-6(; — yj). (1.7.5) 


The advantage of the second quantized approach is that the interacting Hamil- 
tonian can be written explicitly, without having to introduce sums over 
topologies. Showing that the Hamiltonian is Hermitian is sufficient to fix the 
weights of all diagrams and to demonstrate the unitarity of the S-matrix. 

In summary, the pros and cons of first and second quantizations are as 
follows: 


First Quantization 

(1) Interactions must be added in by hand, order by order in the coupling 
constant. 

(2) Unitarity of the final S-matrix is not obvious. This must be explicitly 
checked order by order. 

(3) The formalism is necessarily a perturbative one, since the expansion in 
topologies is intimately tied to the expansion in terms of the coupling 
constant. 

(4) It is difficult to describe the theory off-shell. 


Second Quantization 

(1) The interactions are explicit in the action itself. 

(2) Unitarity is guaranteed if the Hamiltonian is Hermitian. 

(3) The theory can be formally written nonperturbatively as well as perturba- 
tively. 

(4) The theory is necessarily off-shell. 

The transition from the first to the second quantized theory can also be per- 
formed most easily in the path integral formalism in the Coulomb gauge. 
Earlier, we showed that the Green’s function for a propagating free point 
particle can be explicitly evaluated: 


1/2 _x/y 
m _m(Xp — Xa) 
. a pucieee | 1.7.6) 
pee) tea = al a | Ath = ta) 


This Green’s function can also be written in a second quantized fashion. Let 
’ us start with the Hamiltonian: 
] 


H=->-V (1.7.7) 
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The Green’s function satisfies 
(id, — H)K (x, ty x’, t') = 6 (x — x')6(t — £’). (1.7.8) 
Solving for this Green’s function, we find 
K (a, b) = {id, — Ale ee (1.7.9) 


where we are treating the inverse Green’s function as if it were a discrete matrix 
in (x, t) space, and we have dropped trivial normalization factors. This allows 
us to write the integral in second quantized language. To demonstrate this, we 
will use the following identities throughout this book: 


N N N 
/ | [ax exp a =e Pi cs 
nk ga = 


oi 


wie 


(This integral can easily be derived using our earlier formula for the Gaussian 
integral (1.3.18). We simply diagonalize the A matrix by making a change of 
variables in x. Thus, the quadratic term in the integral becomes a function of the 
eigenvalues of the A matrix. Because all the modes have now decoupled, the 
Gaussian integral can be performed exactly by completing the square. Finally, 
we make another similarity transformation to convert the eigenvalues of A 
back into the A matrix itself.) 
From this, we can also derive the following: 


N N N 
ES. I] dx; exp I =p Se Apae | 
i=l i=l i=l 


5.8 l 

ae <J(A')if J; Aj;|| 
~le5 exp {FI Jip] det ai 

~ (Aq "nm(det |Ay|)". (7.11) 


These are some of the most important integrals in this book. Using these 
equations, we can now write the Green’s function totally in terms of second 
quantized fields: 


K(a.b) = fo... ean DY Du exp fi f asartovn), (17a) 
where 


Li) = w"(id, — H)y, (1.7.13) 


where we are again treating K (a, b) as if it were a matrix in discretized (x, f) 
space. 
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In summary, we now have two complementary descriptions of the point 
particle. We can write the theory either in terms of the particle’s coordinates 
x, or in terms of its fields y(x). 

At the free level, both descriptions are totally equivalent, both in ease of 
description and also in mathematics. However, at the interacting level, distinct 
differences appear. For example, it is easy to write 


L,~¢, ~¢', (1.7.14) 


and we are guaranteed to get a unitary description of an interacting field. 
However, in the first quantized approach, the sum over topologies: 


S (1.7.15) 


topologies 

is a clumsy way in which to describe a unitary theory. We must check uni- 
tarity order by order in increasingly complicated diagrams. Furthermore, we 
are forced to adopt a totally perturbative description for the first quantized 
description. The sum over topologies in the first quantized path integral is a 
sum over perturbative Feynman diagrams, so the formulation is necessarily 
perturbative from the very beginning. That is the fundamental reason why we 
have divided this book into first quantization and second quantization. 


1.8 Harmonic Oscillators 


One example that will illustrate the relationship between first and second quan- 
tizations is the harmonic oscillator problem. This example will prove helpful 
in introducing the harmonic oscillator representation, which we will use ex- 
tensively for the string model. Let us begin with a point particle governed by 
the following Hamiltonian: 
2 p 1 

ol a tag kt (1.8.1) 
where k is the spring constant. Because the momenta and coordinates are 
conjugates, we can use the same arguments presented earlier in our discussion 
of path integrals to set 


[p,x] = —i. (1.8.2) 
We can now redefine our coordinates and momenta in terms of harmonic 
oscillators: 
P= (4ma)'/"(a +a'), 
x =i(2mw) (a — a’), (1.8.3) 
ier 
k= mo’. (1.8.4) 
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In order to satisfy the canonical commutation relation (1.8.2), we must have 


[a,a'] =1. (1.8.5) 
If we insert this expression back into the Hamiltonian, we find 
fi Sw(aa' eguia): (1.8.6) 


By extracting a c-number term, we can write this in normal ordered fashion: 
H = w(a'a+ho), (1.8.7) 


where hp is the zero point energy. We can now introduce the Hilbert space of 
harmonic oscillators. Let us define the vacuum as 


a\0) = 0. (1.8.8) 


Then an element of the Fock space of the harmonic oscillator Hamiltonian is 
given by 


(a‘)" 
2) = |0) (1.8.9) 
. Vn! 
such that the states form an orthonormal basis: 


The energy of the system is quantized and given by 
E, = (n+ 4)o. (sett) 


So far, the systems has been presented only in a first quantized formalism. 
We are quantizing only a single point particle at any time. We would now like 
to make the transition to the second quantized wave function by introducing 


|>) = } > on In), (1.8.12) 
n=0 


where we power expand in the basis states of the harmonic oscillator. Thus. 
instead of describing a single excited state of a point particle, we are now 
introducing the wave function, which will be a superposition of an arbitrary 
number of excited states. 

Let us make the important definition 


(x|P) = D(x). CUS.13} 


This can be calculated explicitly. Notice that we now have two independent 

basis states, the harmonic oscillator basis |) and the position eigenvectors |x). 

We must now calculate how to go back and forth between these two bases. 
Let us first analyze the simplest matrix element: 


Oo(x) = (x|0). (1.8.14) 
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This matrix element satisfies the equation 


0= Xx] a|0) 
; — ima 
= (x| Bex |0) 


2mw 


= (2mw)7'/? (-= _ imo (x |0) 


= —i(2mw)~'/ (~ + mor) o(x). (18515) 
This last equation can be solved exactly: 
o9(x) = (mw/m)'/4e7 (1.8.16) 
where 
€ = (mo)!'/*x. (1.8.17) 


It is now a straightforward step to calculate all such matrix elements. Let 


o,(x) = (x|n) 
cl (yaa 0) 
= (n!)'?(2ma)""/ (x| [p + imwx]" |0) 


FA) n 
= (n!)"'?(2mw)-"/2" (-i2 + ima) G(x). (1.8.18) 
The solution is therefore 
6(x) = i"(2"n!)-/?(mw/x)'/4 (« = =) ele” (1.8.19) 


In general, these are nothing but Hermite polynomials H,,. In terms of these 
polynomials, we can express the eigenstate |x) and |n) in terms of each other: 


Ix) = Do In) (nix) = Do In) ona), 
n=l n=) 


in) = tx) f ax txln) = f deon(x) lx), 


(1.8.20) 


Thus, using (1.8.12) and (1.8.20), we have the power expansion of the wave 
function in terms of a complete set of orthogonal polynomials, the Hermite 
polynomials: 


(x) = (x10) = Ol bain) = od HlBOM™. (1.8.21) 
n=l 


n=] 


Similarly, it is not difficult to calculate the Green’s function for the propaga- 
tion of a point particle ina harmonic oscillator potential. The Green's function 
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would be the same as if we had started with the second quantized formalism 
with the action: 


1 
L = ®(x)* (ia + ae ~ thx") (x). (1.8.22) 
m 


From this second quantized action, we can therefore derive the equations of 
motion: 


id, (x, t) = E aly + $kx *) 0 t) 
= H®P(x, t). (1.8.23) 


From this, we can define the canonical momenta conjugate to P(x, t) such that 
the canonical quantization relations are satisfied: 


[TI(x, t), B(x’, t)] = —id(x — x’). (1.8.24) 


1.9 Currents and Second Quantization 


Let us begin with a discussion of the relativistic second quantized theory, which, 
as we have seen, is equivalent perturbatively to the first quantized theory. When 
quantizing the point particle in the Gupta—Bleuler formalism, we were led to 
the equations of motion: 


[0 — m’]¢ =0 (1.9.1) 
which can be derived from the second quantized action: 
L = 4[0,00"¢ + m’¢’]. (1.9.2) 


One of the most powerful techniques we used to explore the first quantized 
theory was symmetry. We would now like to study the question of symmetries 
within the second quantized formalism. 

First, let us calculate the equations of motion by making a small variation 
in the field and requiring that the action be invariant under this variation: 


bL 
== 5 53 qo, 
[ars (S064 2% 39,0), (1.9.3) 
Let us now integrate by parts, using 50,,@ = 0,,4¢: 


bL éL 
5S = | d?x — a, ) | Das (= é ). 
i al Steg | Oe 


If we temporarily ignore the surface term, the action is stationary if we have 
the following equation of motion: 


éL éL 


borat (1.9.5) 
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If we insert the Lagrangian into this equation, we obtain the equations of 
motion, which reproduce the constraint found earlier in the first quantized 
formalism. 

Let us now make a small change in the fields, parametrized by a small, as 
yet unspecified, number e°%: 


bg 
dd = — Se". 9. 
p aoa € (1.9.6) 


If we insert this into the previous equation for the variation of the action, keep 
the surface term intact, and assume that the equations of motion are satisfied, 
then we have the following: 


OL 5b 
8S = | d?xd — ]} de”. LO, 
/ Vas =) aU 
Let us define the tensor in the parentheses as the current: 
6L bo 
Jt = ———., 1.9.8 
* $0, de% ( ) 


Then we have the important equation 
6s = [arxa,stser. (1.9.9) 
Thus, if the action S$ is stationary under this variation, we have a conserved 
current J**: 
o> = 0 (1.9.10) 
We will use this equation over and over again in the discussion of strings when 
we want to extract the current for supersymmetry and conformal invariance. 


Finally, we note that the integrated charge Q% associated with the current is 
constant in time: 


J feta = [ aexays + surface term. (1.9.11) 
Thus, 
a [artes 
Le ee << ay (1.9.12) 


Finally, we wish to construct yet another conserved current associated with 
the action. Let us make a small variation in the space-time variable: 


Ox =e. Glee mke)) 
Under this charge, the volume element of the integral changes as 


bd a = do xOOx. (1.9.14) 
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Therefore, the variation of the action under this change is 


6S = sige Sell, 


6L 
b6L = 6x"0,L a —— 50,¢. 1.9.15 
x" oy ee 5b ot+ 30,6 uP ( ) 
Now, if we assume that the equations of motion are satisfied, we have 
éL 
= | dx, Lé? — ——9 1.9.16 
a i ri {(+ 566 wo ox | es 
If we now define the a ae tensor as 
Ty = age - yn (9) 


then we have the equation 
6S = f arxa,crax.). (1.9.18) 
So if the action is invariant under this change, then the energy-momentum 
tensor is conserved: 
oh sel be (1.9.19) 


For example, for the scalar particle action, the energy-momentum tensor 
becomes 


Tyv = 8,09,0 — Quy (1.9.20) 


which is conserved if the equation of motion holds. 

Lastly, it is instructive to investigate how the various quantization proce- 
dures treat the Yang—Mills field (see Appendix). Let us begin with the SUCY) 
invariant action: 


L=-;1F.), (1.9.21) 
where 
Fi, = 0,4, — OA, — f° AGA. (92?) 
The action is invariant under 
SA, = 0,A7 = fan, (6723) 


where A“ is a gauge parameter. 
The path integral method begins with functional 


= | | [@A.@e' PAU (1.9.24) 


UX 


Now we consider the three methods of quantization. 
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Coulomb Quantization | 
The gauge invariance permits us to take the gauge 
Vi At = 0. 


We can integrate over the Ag component because it has no time derivatives, so 
the Coulomb formulation is explicity ghost-free. (The price we pay for this, of 
course, is the loss of manifest Lorentz invariance, which must be checked by 
hand.) In this gauge, the action becomes 

. L= +(OAf) — 4(Fyy +++ (1.9.25) 


where all fields are transverse. This is the canonical form for the Lagrangian. 


Gupta—Bleuler Quantization 


The advantage of the Gupta—Bleuler formulation is that we can keep manifest 
Lorentz symmetry without violating unitarity. For example, let us take the 


gauge 


anal. = 0, (1.9.26) 
In this gauge, the propagator for massless vector particles becomes 
HE (1.9.27) 
P 
Notice that the propagator explicitly contains a ghost. The timelike excitation 
has a coefficient of —1 in the propagator, which represents a ghost. However, 


we are free to quantize in this covariant approach because we will impose the 
ghost-killing constraint on the Hilbert space: 


(p| 0,A”* |W) = 0. (1.9.28) 


This constraint allows us to solve for and hence eliminate the ghost modes. 
Thus, although the free propagator will allow ghosts to propagate, the Hilbert 
space is ghost-free, so the theory itself is both Lorentz invariant and ghost-free. 


BRST Quantization 


The BRST approach begins by calculating the Faddeev-Popov determinant 
(1.6.10). Let us calculate the determinant of the matrix: 


-" d(@jpAl{x)) 
M(x, y) = aay UC) 
a ‘ Ou Na (oa) 
ee SOD 
0) DS y= y)6*”). Cl) 
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As before, we can write the determinant of M“” by exponentiating it into the 
action using (1.6.10): 


ee = (3,AK*)? + MC? (1.9.30) 
OL 


where the anticommuting Faddeev—Popov ghost fields are represented by c 
and c. This action is invariant under the following BRST transformation: 


$A% = (V,c)%e, 
1 pabd bd 
BRST. § 6° =—3 feces, (1.9.31) 
_ 
60° = —(0,.A"*)e. 
ry 


Once again, it is important to notice that the BRST transformation is nilpotent. 
The BRST symmetry is not connected to the conservation of any observable 
quantity. From the previous invariance, we can extract out the generator of this 
transformation S§ such that 


QO” =0. (1.9.32) 
The physical states of the theory then satisfy 
Q |phy) = 0. (1.9.33) 


1.10 Summary 


The great irony of string theory, which is supposed to provide a unifying 
framework for all known interactions, is that the theory itself is so disorganized. 
String theory is often frustrating to the beginner because it is full of folklore. 
conventions, and arbitrary rules of thumb. The fundamental reason for this is 
that string theory has historically evolved backward as a first quantized theory. 
rather than as a second quantized theory, where the entire theory is defined 
in terms of a fundamental action. The disadvantages of the first quantized 
approach are that: 


(1) The interactions of the theory must be introduced by hand. They cannot 
be derived from a single action. 

(2) Unitary is not obvious in this approach. The counting of graphs must be 
checked tediously. 

(3) The formulation is perturbative, so that crucial nonperturbative calcula- 
tions, such as dimension breaking, are beyond its scope. 

(4) The formulation is basically on-shell, rather than off-shell. 


By contrast, the advantage of the second quantized approach is precisely 
that everything can be derived from a single off-shell action, where unitarity 
is manifest and nonperturbative calculations can, in principle, be performed. 
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Unfortunately, string theory evolved historically as a first quantized theory. 
Thus, string theory has been evolving backward, with the second quantized ge- 
ometric theory still in its infancy. For pedagogical reasons, we have introduced 
string theory from a semihistorical point of view, beginning with the first quan- 
tized theory and later developing the second quantized theory and M-theory. 
We hope that future accounts of string theory will reverse this sequence. 

To reduce the level of arbitrariness in the first quantized theory as much as 
possible, in this chapter we have tried to lay the groundwork for string theory 
in the formalism of path integrals. This functional formalism has the great ad- 
vantage that we can express the first and second quantized gauge theories with 
equal ease. We find, in fact, that large portions of the path integral formulation 
of point particles can be incorporated wholesale into string theory. 

The path integral method postulates two fundamental principles that express 
the essence of quantum mechanics: 


(1) The probability P(a, b) ofa particle going from point a to point b is given 
by the absolute value squared of a transition function K (a, b), 
P(a, b) = |K(a, b)|’. 


(2) The transition function is given by the sum of a phase factor e'*, where S$ 
is the action, taken over all possible paths from a to b, 


AC) a 


paths 


In the limit of continuous paths, we have 


b 
K(a, b) = / Dire 


where 


The action S of the first quantized point particle is given by the length 
of the path that the particle sweeps out in space-time. We can represent the 
Lagrangian for the point particle in three ways: 


first-order (Hamiltonian) form: L= p,x* — Fel Di, +m’), 


second-order form: L= 4(e~'x?, — em’), (1.10.1) 


nonlinear form: L=—m,/—x?. 
Unfortunately, because all three forms of the action are parametrization- 
invariant, the path integral diverges. Thus, the quantization procedure must 
, break this gauge symmetry and yield the correct measures in the functional. 
‘These actions can be quantized in three ways, each with its own advantages 
and disadvantages: 
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(1) Coulomb Quantization 


By explicitly fixing the value of some of the fields, such as 
30) =] Se 


we can eliminate the troublesome negative metric states and the Lagrangian 
becomes 4mv;. The Coulomb quantization method is therefore manifestly 
ghost-free. However, the disadvantage of this method is that it is very awk- 
ward because manifest Lorentz symmetry is broken and must be checked 
at every level. 


(2) Gupta—Bleuler Quantization 


The advantage of the Gupta—Bleuler quantization method is that we have 
a manifestly covariant quantization program. Of course, negative metric 
ghosts are now allowed to circulate in the theory, but they are eventually 
eliminated by imposing the gauge constraints directly onto the Hilbert 
space: 


[p, +m’]|¢) = 0. 


Thus, the S-matrix is ultimately ghost-free. The disadvantage of this 
approach, however, is that the imposition of these gauge constraints, 
especially at the interacting level, is frequently quite difficult. 


(3) BRST Quantization 


This method of quantization keeps the good features of both approaches. 
The theory is manifestly covariant, but the S-matrix is still unitary because 
the addition of ghost fields in the theory cancels precisely against the 
negative metric states. The BRST method imposes the gauge e = | in 
the first-order form and then inserts the Faddeev -Popov term A¢p into the 
functional to get the correct measure. We can exponentiate this determinant 
into the action by using Grassmann variables: 


App — det |0z|/= [« doei farGa0 


The resulting gauge-fixed action has a residual symmetry, called the BRST 
symmetry, which is generated by Q, the BRST charge. 


When we generalize these methods to the interacting case. the path integral 


formulation begins with the fundamental formula for the transition function 
for N -particle scattering: 


Alki; ke, ...5 kvm > g" | DxAr 


topologies 
N 
x exp if drLay +i) buat 
fll 
” e gr (el Dict kuti’), (1.10.2) 


topologies 
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The first quantized description for the N-particle scattering amplitude is 
clumsy because we must explicitly sum over certain topologies, which must be 
put in by hand. This means that unitarity is not obvious in the first quantized 
formalism. Later, we will see that this problem in the first quantized point 
particle theory carries over directly into the first quantized string theory. In the 
second quantized description, however, all topologies can be derived explicitly 
from a single action. 

The transition from a first to a second quantized description is straight- 
forward in the path integral formulation. For example, the propagator can be 
written in either first or second quantized language: 


Xb 
Mar = | Drei fe ath 
x 


“a 


= / Db Dv Wxa)W(xp el OLY, (1.10.3) 
where 
(xlw) = WO), 
L(t) = 4mx?, (1.10.4) 


L(w) = Ww" (id, — Hy. 

The last equation is the Lagrangian for the Schrédinger wave equation, which 
can be derived beginning with the postulates of path integrals and L = tm). 

For the interactions, it is also possible to extract the second quantized vertices 
from the first quantized theory in exactly the same fashion. We simply write 
down the functional integral over a world sheet where the point particle splits 
into other point particles, and then write this Green’s function as a functional 
integral over second quantized fields. 

We will shortly see the advantage of carefully working out the details of 
point particle path integrals. We will find that almost all of this formalism 
carried over directly into the string formalism! 
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CHAPTER 2 


Nambu—Goto Strings 


2.1 Bosonic Strings 


String theory, at first glance, seems divorced from the standard techniques 
developed over the past 50 years for second quantized field theories. This 
is because string theory was first historically discovered as a first quantized 
theory. This is the reason why string theory at times appears to be a random 
collection of arbitrary conventions. Although a second quantized field theory 
can be derived completely froma single action, a first quantized theory requires 
additional assumptions. In particular, the vertices, the choice of interactions, 
and the weights of these perturbation diagrams must be postulated by hand 
and checked to be unitary later. 

Fortunately, the path integral formalism for the first quantized point particle 
has been generalized for the string by J. L. Gervais and B. Sakita, which enables 
us to write down the dynamics of interacting strings with remarkable ease. 

In the previous chapter, we laid the crucial mathematical groundwork for a 
discussion of the first quantized point particle theory. Surprisingly, almost all 
of the main features of the Nambu—Goto string have some form of analogue 
in the first quantized point particle theory. Of course, entirely new features are 
found in the string theory, such as the existence of powerful symmetries on the 
world sheet, but the basic methods of quantization can be carried over directly 
from the point particle case studied in the previous chapter. 

We saw that the usual formulation of second quantized field theory can 
be rewritten in first quantized form. Thus, the traditional covariant Feynman 
propagator (1.6.19) can be written via (1.3.28), (1.3.30), (1.3.37) as 


Ar(%1, x2) = 4 Fam |x2) 
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= ax [ Dre Cfo tee) AUD) 


where we integrate over all possible trajectories of a particle located at x,,(T) 
which start at x, and end at point x). The interactions, we saw, were introduced 
by hand into the theory by postulating a particular set of topologies over which 
this particle can roam. The scattering amplitude, for example, is 


yf ox Aree Laci 1, 


ACK) R22 22a) 


topologies 


N 
De le: (2.19) 


topologies \i=1 
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where we integrate over topologies that form the familiar Feynman diagrams 
for ¢? or ¢* theory. 

It is important to notice that the resulting Feynman diagram is a graph, not a 
manifold. At the interaction point, the local topology is not R”, so it cannot be a 
manifold. There is no correlation between the internal lines and the interaction 
points. This means that we can introduce arbitrarily high spins at the interaction 
point of the first quantized relativistic point particle. Thus, the first quantized 
point particle theory has an infinite degree of arbitrariness, corresponding to the 
different spins and masses we can place at the interaction point. Furthermore, 
the ultraviolet singularities of each Feynman diagram correspond to the number 
of ways we can “pinch” the diagram by shrinking an internal line to zero, thus 
deforming the local topology. 

This picture, however, totally changes with the string. Although the path 
integral formalism looks almost identical, there are profoundly important dit- 
ferences. In particular, the sum over histories becomes a sum over all possible 


FIGURE 2.1. Vertex functions for point particles and strings. A large number of 
point particle theories are possible, based on different spins and isospins, because 
the Feynman diagrams are graphs. Only a few string theories are known, however, 
because the interactions are restricted to be manifolds. not graphs. Conformal sym- 
metry, modular invariance, and supersymmetry place enormous restrictions on the 
manifolds we may use to construct superstring theories that have no counterpart in 
point particle theory. 
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tubes or sheets that we can draw between two different strings (see 
Fig. 2.1). This world sheet, in turn, is a genuine manifold, a Riemann sur- 
face, so the set of interactions consistent with the propagator is severely limited. 
Thus, we expect to find a very small number of string theories, in contrast to the 
infinite number of point particle theories we can write. Furthermore, the super- 
string theory does not suffer from ultraviolet divergences caused by shrinking 
one of the internal lines to zero. You cannot “pinch” the string world sheet to 
obtain an ultraviolet divergence. Thus, string theory is free of ultraviolet diver- 
gences from strictly topological arguments. (We must be careful to point out 
that this pinched diagram, however, can be reinterpreted as an infrared diver- 
gence representing the emission of massless, spin-0 particles into the vacuum. 
Fortunately, supersymmetry eliminates these infrared divergences.) 

In summary, although the path integral formalism can treat both the first 
quantized point particle and the first quantized string theory with relative ease, 
there are profoundly important physical differences between the two theories 
that arise from strictly topological arguments. 

We begin our discussion of strings by first introducing the coordinate of 
a string vibrating in physical space-time. Let the points along the string be 
parametrized by the variable o, and then let the string propagate in time. Let 
the vector 


X (9, T) Mey) 


represent the space-time coordinates of this string (see Fig.2.2) parametrized 
by two variables. When the string moves, it sweeps out a two-dimensional 
surface, which we call the “world sheet.” We will parametrize this world sheet 
with two variables, o and t. The vectors that are tangent to the surface are 
given by the derivatives of the coordinate: 


OX, Oxy 
tangent vectors = i : (2.1.4) 
T 


do 


FIGURE 2.2. The two-dimensional world sheet swept out by a string. When a string, 
which is parametrized by a, moves in space-time, it sweeps out a two-dimensional 
surface parametrized by o and t. The string variable X,,(0, T) is just a vector that 
extends from the origin to a point on this two-dimensional manifold. 
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The contraction of two of these tangent vectors yields a metric: 
VP, Gre), eae (15) 


where we have now replaced the two variables (t, 0) with the set (a, b), where 
a, b can equal either 0 or |. The infinitesimal area on this surface can be written 
simply as 


d Area ~ ,/det |g,,| do dt. (2.1.6) 


In analogy to the point particle case, where the action is the length swept out 
by the point, we now define our action to be the surface area of this world 
sheet. Our Lagrangian is therefore [1—4]: 


1 : 

| ae (Xe Oe, (207) 
where the prime represents o differentiation and the dot represents t differenti- 
ation. (When we discuss M-theory, we will introduce the theory of membranes 
in arbitrary dimensions in the same fashion. But instead of two coordinates 
on a world sheet, we now have (p + 1)-coordinates on a world volume. The 
action for the p-brane is still the same: the square root of the determinant of 
the metric tensor. The action is just the volume of the membrane.) The action 
is just the Lagrangian integrated over the world sheet, which is the total area 
of the two-dimensional surface: 


= [4 adtl(o. 7): (Zales) 


The Green’s function for the propagation of a string from configuration X,, at 
“time” tT, to configuration X;, at “time” 1, as well as the path integral over 
a surface that expresses the topology of several interacting strings. can be 
represented as 


Xp - _ 
KX Ape | DXe~fe tt dol 


VXa 


i | duDXe**, (2.1.9) 


topologies 


where DX = [],,,,, dX,(o. T), du represents the measure of integration over 
the location of the external legs, and where we have made a Wick rotation in 
the t variable (t —> —it) so the integral converges. 

The correspondence between the point particle path integral formalism that 
we carefully developed in the previous chapter and the string formalism is 
quite remarkable. We find that almost the entire point particle formalism can 
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be imported into the string formalism: 


x,(T) X,(, T) 
length = area 
| [@xu@) |] 4X,e. 2) 
[Vise U,o,T 


Similarly, the path integral for the point particle and the string theory have 
surprising similarities. The N-point function for the N-string scattering am- 
plitude can also be written as a Fourier transform, similar to point particle path 
integrals: 


xj ae Xj - : 
/ Drews Loe / DX e700 Sy? Lat 
x, x 
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Although there are remarkable similarities between point particle and string 
theories when expressed in the language of path integrals, the crucial difference 
between them emerges when we analyze the topologies over which the objects 
can move. For the point particle case, the topologies are graphs, as in Feynman 
graphs, whereas the topologies for string theories are manifolds: 


Graphs — Manifolds. 


One of the crucial reasons why there are so many point particle actions (and so 
few string actions) is the difference between graphs and manifolds. The nontriv- 
ial restrictions placed on manifolds severely restrict the number of consistent 
string theories. 

As in the case of the point particle, the choice of parametrization was totally 
arbitrary. Thus, our actions must be reparametrization invariant. To see this, 
let us make an arbitrary change of variables: 


(anc): (2.1.10) 
Under this reparametrization, the string variable changes as 
5X" = X80 + X“6r. Clue) 


Because the area of a surface is independent of the parametrization, the action 
is manifestly reparametrization invariant, which is easily checked. 
As before, let us now write down the canonical conjugates of the theory: 


BE XX 0G) Xx", 
= er 


1 
~ gxe na’ /detlo,X"o,X,| 
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As in the point particle case, these momenta are not all independent. In fact, 
we find two identities that are satisfied by the canonical momenta: 


1 
2 2 
——. XxX" =0, 
oa Qroey * (20012) 


Pix = 0. 


Constraints: 


Thus, the canonical momenta are constrained by these two conditions. If we 
calculate the Hamiltonian of the system, we find that it vanishes identically as 
in (1.4.9): 

H = P,X*—L=0. (Qiua3) 
The vanishing of the Hamiltonian and the presence of constraints among the 
various momenta are indications that the system is a gauge system with an 
infinite redundancy. The reparametrization invariance of the system is the ori- 


gin of this redundancy. We can therefore write down the close correspondence 
between the constraints of the point particle theory and the string theory: 


i ) 

P? + ———X” =0, 
{p?>+m=0} —> Ee (Ose 
PX =0. 

Before we begin a detailed discussion of the quantization of the string. it is 
instructive to investigate the purely classical motions of this string. Let us first 
classically set the parameter t equal to time, so that 

Ope (cl) 
X', = (0, X;). (2.1.14) 
Then let us factor out X” from the action (2.1.7): 
l 9) 4 oe /9 
b= eet, ee (2.1.15) 
27a’ 


where v is the velocity component perpendicular to the string: 
—* Xi. (2.1.16) 


The boundary conditions that we derive from this gauge-fixed action include 
p= 1. (Qaligt7) 


This means that the ends of the classical string travel at the speed of light. 

We can also calculate the energy of the classical string. Let us assume that 
the string is in a configuration that maximizes its angular momentum, i.c., it is 
a rigid rod that rotates with angular velocity « around an axis labeled by the 
unit vector r. The string can be parametrized as 


eee 88 (2a0ah8) 
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where 
b= wr 
F-w= il, (2.1.19) 


and —! <o <1. 
To calculate the energy and the angular momentum of the system, we must 
first write down the Lorentz generators associated with the string: 


Me — / do(P’X" — P#x’y., (2.1.20) 


Notice that this generates the algebra of the Lorentz group if we impose the 
Poisson brackets: 


[Xu(o), Pr(o’)] = nuvd(o — 0”). i211) 


We can now calculate the energy and the angular momentum from the 
components of the Lorentz generator [5]: 


t 
E= ra aol o0-) 2 = 7 
1 ; a 
= mee . ai =a x(a x 
= w(4a'a*)! 
~ WE a’. (2.1.22) 


Thus, the angular momentum of the rotating string is proportional to the square 
of the energy of the system: 


|J| ~ E?. 1-23) 


If we plot the energy squared on the x-axis and the angular momentum on 
the y-axis, then we obtain a curve called the Regge trajectory. The slope of the 
Regge trajectory is given by a’ and the curve is linear. Thus, we have obtained 
the leading Regge trajectory for the classical motion of a rigid rotator. We 
will, throughout this book, take the normalization w’ = }. This is an arbitrary 
convention. However, we will see later that the intercept ao of the leading 
trajectory must be equal to one, which is fixed by conformal invariance once 
we quantize the theory. Thus, we set 


/ 


VS 


eS wi 


ay = (2.1.24) 


When we quantize the system, we will find that there is an infinite number 
of such parallel Regge trajectories, but with increasingly negative y-intercepts. 
' As we have stressed, there is a crucial difference between the point particle 
case and the string, which is that the string system has a larger set of con- 
straints that generate the gauge group of reparametrizations. For example, if 
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we introduce the canonical Poisson brackets, then we can explicitly show that 
the constraints generate an algebra. 

To calculate this algebra, we decompose the X and P in terms of normal 
modes for the open string: 


“1 
XO) eer ee y — X! cosno, 
n=] vn 


1 co 
P#(o) = — { p*# + y Jn P# cosno > . (21.25) 

1 
i 
Notice that the X decomposition is given strictly in terms of cosines. This is 
because, when we calculate the equations of motion of the string, we must 
integrate by parts and hence obtain unwanted surface terms ato = 7m and 
o = 0. In order to eliminate these surface terms, we must impose 

ee 0 (2.1.26) 


m 


at the boundary. This boundary condition eliminates all sine modes of the 
string. (These are called Neumann boundary conditions. It was once thought 
that Dirichlet boundary conditions, where the derivative of the string variable 
is a nonzero constant, lead to unphysical strings, since the endpoints are now 
fixed in space and time, terminating on a stationary hyperplane. When we 
discuss M-theory, however, we will find that these hyperplanes can actually 
move in space and time, oscillating like membranes (called D-branes). These 
D-branes, we will see, are essential in understanding the solitonlike solutions 
of string theory.) 

We will sometimes find it convenient to take advantage of this form of the 
expansion. In particular, it means that 


X (0) = Xul—-0), 

X_,(a) = —X,(—o). (2ele27) 
The same applies for the modes of the canonical conjugate P,,. This, in turn, 
allows us to combine both constraints into one, using the properties of the 


string under reflection from o into —o. If we let the string parametrization 
length be 2 we can define 


opt xX’ \?2 

Ly =— | dof(c) (via, + — ) (2.1.28) 
(~ an ie. 

where f(o) is an arbitrary function defined from —: to 2. Notice that both 

constraints are now combined into one equation because of this reflection 

symmetry. Using (2.1.21), we can show that these generators form a closed 

algebra: 


eet | 


[Ly, Le] = Lise, (2.1.29) 
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where 
i xg= fe’ — ef’. (2.1.30) 
It is also possible to show that this algebra satisfies the Jacobi identities: 
Retr: (oe, eal 0, Cet) 


where the brackets represent all possible cyclic symmetrizations. This algebra 
is called the Virasoro algebra [6], which will turn out to be one of the most 
powerful tools we have in constructing the string theory. 
As in (1.4.11), we can elevate the constraints into the action with Lagrange 
multipliers A(o, tT) and p(o, T): 
; xX? 
L=P,X"+27a')| P? +—*— Pex 271032 
# E  Qaa'y pag ( ) 
By functionally integrating out over these Lagrange multipliers, we arrive at 
the previous set of constraints. Not surprisingly, this new action has its own 
reparametrization group parametrized by n and ¢: 


6X, = 2na'eP, +X’ 


be? 


eae 
Oy ae nPa | 33) 


27a’ 

dA = —E4+ANn-—nNi4+ p'e — pe’, 

69 =—n+Ne—de'+pn—7np. 
The advantage of this form for the action is that it is first-order and does not 
have the bothersome square roots of the original action. As in the point particle 
case, this indicates that there exists yet one more form for the action, expressed 
in terms of an auxiliary field. To find this third formulation of the action, let 
us introduce a new independent field 


~ Gup(G, T) (2.1.34) 


which represents a metric on a two-dimensional surface. Unlike our previous 
discussion, this metric is now totally independent of the string variable. Let us 
write down the Polyakov form of the action [7] (g = | det g,,|): 


i 
L=-——-. or Oy Made. Pad ee) 
tra ¥88 ub ( ) 


This is a generalization of the second-order point particle action (1.4.14). No- 
tice that the Polyakov action resembles an action with scalar fields interacting 
with an external two-dimensional gravitational field. This action, too, possesses 
manifest reparametrization invariance: 

One) == ag ae 

dg” = 0-2” — 20,2" — 9” 0,6", (2.1.36) 


b,/2 =a, (© -/8). 
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The action is also trivially invariant under Wey] rescaling: 
5g%? = Ag”. (2.1.37) 


The Polyakov action is entirely equivalent at the classical level to the earlier 
Nambu—Goto action. As in the Nambu—Goto formalism, we can derive the 
Virasoro algebra. By varying with respect to the metric tensor, we obtain the 
energy—momentum tensor, which we can set to zero: 


roe = mew 5get (2.1 38) 
Working this out explicitly, we find 
Tap = Oy Xi0pX" 15 Oahe Ook O2%,,- (2.1.39) 


The moments of the energymomentum tensor will correspond to the Virasoro 
generators. Thus, we have another way of deriving the Virasoro generators from 
this new but equivalent formalism. 

Notice that the metric field g,,, is not a propagating field. The metric tensor 
does not have any derivatives acting on it. Thus, we can eliminate it via its own 
equations of motion. This leads us to 


BL 2a, X yOpX™ 


—0 a ab — . 
b8ab gee g°40.X 07x" 


Substituting this value of the metric tensor back into the action, we rederive 
the original Nambu—Goto action. Thus, at the classical level, the two actions 
are identical. 

In summary, as in the point particle case, we now have three different ways in 
which to write down the action, all of which are equivalent classically. Each has 
its own particular advantages and disadvantages when we make the transition 
to the quantum system. These string equations are direct generalizations of 
the three point particle Lagrangians found in (1.4.16). As before, we have the 
second-order formalism, which is expressed in terms of the string variable X,, 
as well as the metric tensor g,,,; the nonlinear formalism, which is expressed 
entirely in terms of X,,, and the Hamiltonian formalism where we have X,, 
and its canonical conjugate P,, (or the pair 0,,X,, and P*“): 


(2.1.40) 
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first-order (Hamiltonian) form: L = Rex M4 omra’dr lay == 


=| 
second-order form: L = Teg V8" 8a X A,X", (2.1.41) 
a’ 


P | "9 5 2 7 / 
nonlinear form: L = Sagi euXe =(Xpx "yy. 
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At first, we suspect that these actions are totally equivalent, so that we can 
choose one and drop the others. This is apparently not so, for two subtle reasons: 


(1) Because we are dealing with a first quantized theory, we have to take the 
sum over all interacting topologies that are swept out by the string. For the 
Nambu-—Goto string, the precise nature of these topologies is ambiguous 
and must be specified by hand. However, for the Polyakov form of the 
action, which contains an independent metric tensor, we can eliminate 
most of this ambiguity by specifying that we sum over all conformally 
and modular inequivalent configurations. (These terms will be defined 
later.) This will become a powerful constraint once we start to derive 
loops and will determine the function measure uniquely. The measure and 
the topologies in the Nambu—Goto action, however, are not well defined. 
(We must point out, however, that this rule of integrating over inequivalent 
surfaces does not automatically satisfy unitarity. This still must be checked 
by hand.) 

(2) The gauge fixing of Wey] invariance for the Polyakov action, although triv- 
ial classically, poses problems when we make the transition to quantum 
mechanics. An anomaly appears when we carefully begin the quantiza- 
tion process. In fact, this conformal anomaly will disappear only in 26 
dimensions! 


Let us now discuss the quantization of the string action. The strategy we will 
take in quantizing the free theory to obtain the physical Hilbert space will be 
first to extract the symmetry of the action, then the currents, and then the algebra 
formed by the generators of this symmetry. (For the string, the symmetry will 
be reparametrization invariance and the algebra will be the Virasoro algebra.) 
Then we must apply the constraints onto the Hilbert space, which eliminates 
the ghosts and creates a unitary theory. It is important to keep this strategy in 
mind as we begin the quantization of the string. 


Action — Symmetry —> Current — Algebra — Constraints — Unitarity. 


As inthe point particle case, we can begin the quantization program in several 
ways. There are three formalisms in which to fix the gauge of the theory: 
(1) Gupta—Bleuler (conformal gauge), (2) light cone gauge, and (3) BRST 
formalism. The advantages and disadvantages of each area as follows: 


(1) The Gupta—Bleuler is perhaps the simplest of the three formalisms. We 
allow ghosts to appear in the action, which permits us to maintain manifest 
Lorentz invariance. The price we must pay, however, is that we must impose 
ghost-killing constraints on the Hilbert space. Projection operators must 
be inserted in all propagators. For trees, this is trivial. For higher loops, 
however, this is exceedingly difficult. 

(2) The advantage of the light cone gauge formalism is that it is explicitly 
ghost-free in the action as well as the Hilbert space. There are no com- 
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plications when going to loops. However, the formalism is very awkward 
and Lorentz invariance must be checked at each step of the way. 

(3) The BRST formalism combines the best features of the previous two for- 
malisms. It is manifestly covariant, like the Gupta—Bleuler formalism, and 
it is unitary, like the light cone formalism, because the negative metric 
ghosts cancel against the Faddeev—Popov ghosts. 


Let us now discuss each quantization scheme separately. 


2.2 Gupta—Bleuler Quantization 


The Gupta—Bleuler formalism will maintain Lorentz invariance by imposing 
the Virasoro constraints on the state vectors of the theory: 


(@| Ly ly) =09, OD)1) 


where (| and |y) represent states of the theory. This constraint will eliminate 
ghosts in the state vectors, allowing us to keep nonphysical negative metric 
ghosts intact in the action. 

Classically, the metric tensor has three degrees of freedom that we can 
gauge away, two arising from reparametrization invariance and one from Wey! 
invariance. Since the metric tensor has three degrees of freedom, we can gauge 
all of its components away: 


=$ = :) 272 
Sab = Sab = 0 1 222) 


which we call the conformal gauge. (There are complications. as we have said, 
in taking the conformal gauge for the quantum theory and for higher loops.) 
Our action reduces to 


Ara’ 


This is exceptionally simple because the action now corresponds to an 
uncoupled free string. This action yields the free equations of motion: 


1 sf? - } 
——— do f arx? — X?), (233) 
0 


(a | OR 7 
02 = ar. X (a, 7 a0 (2.2.4) 
with the boundary condition: 
X (0, t) = X_,(2, tr) =0 (223) 


which we nced to enforce when we integrate by parts and eliminate the surface 
term. The solutions of the equations of motion are arbitrary functions of 0 +t 
and o —T: 


X"(o,t) = X}(o +1) + XZ (0 — 7). (2.2.6) 
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The canonical commutation relations are now 
[P, (0), X(o")] = —iny,8(o — 0’), (2.2.7) 


where 


5(o —o') = it +2 > ‘cos no cosno ). (228) 
n=l 

There is, of course, an infinite number of possible representations of the path 

integral. However, as in the point particle case, we can always choose the 

simplest one, the harmonic oscillator basis [8], where the Hamiltonian becomes 

diagonal. Unlike the point particle case, however, we now have an infinite 

number of oscillators, one set for each normal mode: 


X# = siv2a/(at — att), 
| 

jaNe es 

VJ 2a! 


where we can satisfy the canonical commutation relations if we set 


(at +a",), (2.2.9) 


[Gres Qh jl tOnpr ie (2.2.10) 
It is also conventional to introduce an equivalent set of oscillators: 
= id m> 0, 
a, = mal", m > 0, 
X*(o, tT) = x" + 2a’ p*t + iv2a’\° a ete cosno. (2.2.11) 
n#0 


Written in this basis, the Hamiltonian takes on an especially simple form (see 
(3.37)): 


H= if do(P,,X" — L) 


1 
Pp; Xi )d 
ip Chae a) : 


=O 
fo?) 
= oy na} at + ot’ oe Qo, =v 2a Di, (2.2.12) 


where we have made an infinite shift in the zero point energy. At this point, 
the mass of the lowest-order particle is not well defined because we made 
this infinite shift, but we will later show that this lowest particle is actually a 
tachyon. We will show that the intercept of the model is fixed at 1. 

‘ Notice that each oscillator mode is basically uncoupled from the other oscil- 
lator modes. In fact, the Hamiltonian is diagonal in the Fock space of harmonic 
oscillator excitations. Taking this specific representation of the string function 
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FIGURE 2.3. Regge trajectories for the open string. The x-axis corresponds to the 
energy squared and the y-axis to the spin. The particle farthest to the left is the 
tachyon, which corresponds to the vacuum of the Fock space. The massless spin-1 
particle is the Maxwell or Yang—Mills field, which corresponds to a single creation 
operator acting on the vacuum. There is an infinite number of Regge trajectories. 
corresponding to the infinite excitations of a relativistic string or the infinite number 
of states in the Fock space. 


from the infinite number of possibilities is a great advantage because the al- 
lowed eigenstates of our Hamiltonian are now simply the products of the Fock 
spaces of all possible harmonic oscillators: 


eigenstates: | [i@tt 10), (22513) 
ni 
where the vacuum is defined as 
G0), n> 0. (2.2.14) 
The spectrum of the lower lying states can be categorized as (see Fig. 2.3): 
tachyon — |0), 
- 
massless vector > a,“ |0) , 
massless scalar > k,,a;" |0) , (2.2.15) 
- ; fu ty 
massive spin-2 — a,"a," |0), 
. t 
massive vector — a," |0). 

(The fact that the string theory is so simple to quantize can be traced to 
the fundamental fact that the Hamiltonian is quadratic in the string variables; 
this means that it decomposes into an infinite series of free particles. A vast 
collection of vacuum solutions to string theory can be constructed because, in 
essence, it is a free theory. However, this simplicity breaks down completely 
when we analyze membranes in M-theory. We will find that the Hamiltonian 
is now quartic, making the quantization intractable. In contrast to the simplic- 
ity of string theory, there is still no satisfactory method for quantizing free 
membranes. ) 


As expected, we recover the leading Regge trajectory that we obtained earlier 
from classical arguments, and also an infinite number of daughter trajectories 
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that have an increasingly negative y-intercept. In this gauge, the propagator or 
Green’s function takes on a simple form: 


K(a, b) = (Xq|e~"** |X,) 


x 
es ag ae 
=| Dx exp f ao dr | 0x ~ xi}. (2.2.16) 
where 


DX =| [|] [¢x*@ =] ax". (2.17) 
Yo wn 


We must be careful to note that the functional integral over DX is an infinite 
product of integrals over each point along the string, or each Fourier mode of 
the string. 

Fortunately, this integration can be carried out explicitly. Let the particle 
move from tT = 0 to t = oo. Because the Hamiltonian is diagonal on the 
space of harmonic oscillators, we can perform the integration over t exactly. 
We find 


xp e (2.2.18) 


oO 
Xa dte—# X,) = xe 
(xl f IX») = (Xel -— 
where Ly is the zeroth Fourier component of the Virasoro generators (2.1.28). 
In other words, our propagator for the free theory is 


l 


D= 
Lo -—1 


(2.2.19) 


sandwiched between states of the string. However, because we have the identity 


|X) i! DX ({X|=1 (2.2.20) 


we can remove the path integrals explicitly at every intermediate point between 
the initial and the final state. In fact, because of the simplicity of the N-point 
function, we will be able to remove all functional integrations over string states 
explicitly, leaving only the harmonic oscillators. It is important to stress again 
that the harmonic oscillator formalism is nothing but a specific representation 
of the path integral. Its simplicity is a consequence of the Hamiltonian being 
diagonal in the Fock space of harmonic oscillator states. 

In much the same way, the closed string can also be written in terms of 
harmonic oscillators. For the closed string, there is no X’ = 0 boundary con- 
dition, and we can use both sine and cosine functions to decompose its normal 
modes. Thus, we expect twice the number of oscillators for the closed string. 
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The oscillator decomposition is ie by 


A) = Xy + (La) ye Lo =(dne —ino + anes es aa as ques 72), 
(2231) 
Pu —ino ia gine t+ ino ~t —ing 
PGS /n(—iege" — ta + iave'"’+ iape'"’).u 
4 Qn” 2nV/2a! * a ~ 


The Hamiltonian for the closed string is 


Z see 
n= d ae 
nf 2) od ai aaah 


= na\a, + naan) + a Bae (22.22) 


Again, the Fock space consists of all elements created out of harmonic oscil- 
lators, but this time there is an extra constraint that is not found for the open 
string: 


(Lo — Lo) |¢) = 0. (2.2.23) 


(The interpretation of this constraint is that the closed string must be indepen- 
dent of the origin of the o-coordinate. For example, the operator [ dae’! 
can be interpreted in two ways. First, it generates rotations in o-space, so we 
average over a rotation of 27 in o-space. Second, if we perform the integral, 
we have 6(Lo — beg which is constraint (2.2.23) when applied to the Hilbert 
space. We will return to this constraint later.) 

The Fock space consists of (see Fig. 2.4): 


tachyon — |0), 


massless spin-2 > a}“a'" |0) , (2.2.24) 


|OD- . Kakgaltta 0m 
FIGURE 2.4. Regge trajectories for the closed string. The Fork space is built out 
of two commuting sets of harmonic oscillators. The massless spin-2 particle is the 
graviton, which corresponds to the product of both types of operators acting on the 
vacuum. 
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massless scalar > k, kala’ |0) . 


Notice that a massless spin-2 particle occurs in the spectrum of the closed 
string. When the string model was first being interpreted as a model of hadrons, 
the presence of this gravitonlike spin-2 particle was a great embarrassment. 
Attempts were made to associate this particle with the Pomeron trajectory 
found in S-matrix theory. It can be shown that, when we generalize to trees 
and loops, this massless spin-2 particle has a gauge invariance equivalent to 
the graviton of Einstein’s theory. By dropping the earlier interpretation of the 
string theory as a model of hadrons, we find a natural place for this gravitonlike 
object as the graviton itself. 

In summary, the Gupta—Bleuler formalism in the conformal gauge appears 
simple and elegant, mainly because we are allowing ghosts to appear through- 
out the action. The theory reduces to the simplest possible string theory, a free 
propagating string. 

The price we must pay for this simplicity, however, is the imposition of the 
constraints on the Fock space. We can write the Virasoro generators as [6]: 


/ 2 
7 iia : xX 
is ee d ino | p (id 
4 ia i =] 


ii lo @) 
5 a On—m,por, (22.25) 


2 
m=—0o 


t 


oO 
ose M (Wa 
le ) np, + 5M. 
n=] 


The physical states of the theory must therefore satisfy 
Lp |p) = 0, Ana> 0, 
(Lo — 1)|¢) = 9. (2.2.26) 


The algebra generated by these operators is 


D 
[Lays Lm] =(n—m)Laim + ms One (2207) 


where D is the dimension of space-time. The fact that there is a c-number 
central term appearing in this equation at first sounds surprising, but this can 
be calculated explicitly by taking the vacuum expectation product of a simple 
commutator: 

(0|[L2, L-2]10) = >. (2.2.28) 
The origin of this central term is that we normal ordered the generator Lo to 
obtain finite matrix elements. This normal ordering was even found in the point 
particle case in (1.8.7), except now the energy shift is infinite. The price we pay 
for finite matrix elements is that locality ino is lost. The normal ordering spoils 
the fact that the generators L ¢ were originally local functions ina. The process 
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of quantization necessarily destroys the locality of Virasoro generators in the 
variable a , and hence the c-number central term occurs. Thus, the quantization 
scheme and regularization scheme used to extract finite information from the 
model are actually inconsistent with conformal symmetry. Fortunately, this 
inconsistency can be eliminated if we fix the dimension of space-time to be 
26. 

That there are ghosts in the theory is due to the fact that the zeroth component 
of the harmonic oscillator has negative metric: 


ghost = {a! 5} 0). (2.2.29) 


Thus, the coefficient of the Green’s function occurs with a negative sign. In 
addition, there are zero norm states and negative norm states that must be taken 
into account. 

To analyze the spectrum, let us define a spurious state |S) to be one that is 
orthogonal to all physical real states |R). Spurious states can be written as 


|S) = L_, |x), n> 0, 
Vids) =O) 
for some integer n and some state |x). (If we take the matrix element of this 
state with a physical state, the scalar product always vanishes because L,, 
destroys a physical state.) Now let us construct the spurious state: 
lv) =[L_2 + aL?,)|¢). (2.2.30) 


We do not want this state to be part of the physical Hilbert space. However, let 
us see the conditions under which it might be part of the physical spectrum. 
Let us set 


Li |v) = La|w) =0. 2231) 
This fixes the following: 
3—2a =0, iD-—4-6a=0, (2.2.32) 
which, in turn, fixes 
D265 (2.2.33) 


Fock space. At first, this seems disastrous. We want our physical Hilbert space 
to be ghost-tree. But notice that in 26 dimensions this state has zero norm (not 
negative norm). Since |) was arbitrary, we have constructed an infinite class of 
states |) that are simultaneously spurious and physical. If we take the norm of 
this higher-order state, we find that it also vanishes in 26 dimensions. making 
it a null spurious state. This state is still acceptable because the norm of the 
state is nonnegative. Thus, in 26 dimensions we have an acceptable spectrum 
for this set of states. 

Similar analyses of the state vectors of the theory show that a physical state 
|?) can be constructed which actually has negative norm if D is greater than 
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26: 
|) = (aa?, + bk -a_2 + c(k -a@_1)*) |0). 


Imposing L;|9) = L,|@) = 0, we find that b = a(D — 1)/5 andc = 
a(D + 4)/10. We find that the norm of the state is | 


(010) = 2a(D — 1)(26 — D). 


Thus, the dimension of space-time cannot exceed 26 or else negative norm 
states exist as part of the physical states. In general, we find that the spectrum 
is ghost-free if the dimension of space-time is less than or equal to 26: 


@=l, D=26: | 


i pee (2.2.34) 


ghost-free: | 

This exercise was done only for a piece of the Fock space. But can ghosts 

be eliminated to all orders in the string model? We will return to the difficult 

problem of ghost elimination in the Gupta—Bleuler formalism at the end of this 

chapter, when we actually construct the physical Hilbert space and show that 
it has no negative norm states in 26 dimensions. 


2.3 Light Cone Quantization 


Choosing the light cone gauge, where all unphysical degrees of freedom are 
explicitly removed from the very beginning, is possible because we have two 
gauge degrees of freedom, and hence two gauge-fixing conditions can be in- 
serted into our path integral. One of these gauge-fixing constraints can be the 
elimination of nonphysical modes from the Hilbert space, as in the Coulomb 
gauge. Thus the elimination of ghost states, which is quite involved for the 
Gupta—Bleuler formalism (as we shall see at the end of this chapter), becomes 
trivial in the light cone gauge. 
Let us choose the notation 


1 
Xt= me ee | 
1 


hers XP aN 2.3.1 
Fa! ] ( ) 


xX = 
then 
Ae Be AD Ae. (ean) 


Depending on which version of the action in (2.1.41) we use, we will have 
different gauge constraints. If we start with the original Nambu—Goto action, 
, for example, the gauge conditions in the path integral are 


Z= / DX MApp | [ 8(X*(a) — pt 1)8(X), + Xj — 2X, Xe, (2.3.3) 


68 2. Nambu—Goto Strings 


where M is a measure term that must be added to have a unitary theory, and 
the two delta-functions represent the gauge-fixing constraints. The remark- 
able feature of the second constraint is that the Nambu—Goto action, which is 
expressed as a highly nonlinear square root, completely linearizes [12] 


yf X2.X2 — (KX ~ 3X2 — X?). (2.3.4) 


(Because the light cone action is no longer a square root, we have a well- 
behaved action that can be canonically quantized.) 

The constraint X* = p*t means that the o dependence within X* has 
completely disappeared and that the “time” t now beats in synchronism with 
X+. We can use the second constraint, in turn, to eliminate the X~ modes, and 
hence all longitudinal modes have completely disappeared. The action can now 
be expressed totally in terms of transverse ones. 

Next, we will solve the constraints in the first action in (2.1.41). Let us 
integrate over the Lagrangian multipliers o and A in the Hamiltonian form of 
the action, and then impose the gauge-fixing constraints: 


De / DXDP [see — ptr)s (P+) = o) 


xX 
x 6 (» Hu *) 5(P,x'")e~, (2.3.5) 


Because the covariant Hamiltonian (2.1.13) is equal to zero, the only term in 
the Lagrangian is P,, xX" (the X~ term drops out): 


he / do P,X" = i do(P,;X' — p* P-(o)). (2.3.6) 
0 0 


There are several remarkable features to this formalism. First, we can apply 
four, not two, constraints onto our Hilbert space, two from gauge fixing and 
two by integrating over A and p. Second, because the covariant Hamiltonian is 
equal to zero, the action only consists of P,, X", but the light cone Hamiltonian 
emerges out of the decomposition of (2.3.6): 


I p* | do P(o). (233) 
0 
On the other hand, we can solve the constraint for P~: 
nt see 
P-(o) = — (?: + *) ; (2.3.8) 
ZpT bee 


Plugging the value for P~ into the definition of the light cone Hamiltonian 


(2.3.7), we now have 
MN ay ox xe . 
oH = e ' Nigam a) do (2.3.9) 
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which is the just the Hamiltonian (2.2.12) defined over physical transverse 
states. 


Similarly, we can also eliminate the X~ modes by solving another constraint: 
Lax = 0 (2.3.10) 
which can be solved for X~, yielding 


° 1 
X (a) =|) (eee aie 1k Cesk) 
0 Pp 
Putting everything together, our functional now becomes 
2 / DX, DPyel ft ftehiXi-H) (2.3.12) 


where H is the light cone Hamiltonian density. The great advantage of the light 
cone gauges is that the Virasoro constraints have been explicitly solved, so there 
is no need to impose them on states. All + modes have been gauged away from 
the start, and the — modes have been eliminated in terms of transverse states 
because the Virasoro conditions have been solved exactly through (2.3.8) and 
(2.3.11). Instead of imposing the Virasoro constraints on the Hilbert space, we 
simply solve them exactly and eliminate the — modes. 

However, the great disadvantage of the formalism is that we must tediously 
check for Lorentz invariance at each step of the calculation. Normally, the 
generators of the Lorentz group are given by 


mn = f dole — x7 P4] 
0 
a 
= x" Taam lis ~ 0” ai#}. (2.3.13) 


It is easy to check from (2.2.7) that this satisfies the correct commutation 
relations for the Lorentz group: 
[MY’, M°?] = int®*M'P +.--, (2.3.14) 


However, Lorentz invariance has to be checked once again in light of the fact 
that we have explicitly all ghost modes. Most of the commutators are trivial to 
check, because they are linear. The troublesome term comes from X~, which 
is highly nonlinear and is written as 


Kee te SS —a,e '" cosno, (27375) 
n#0 
where 
1 D-2 ie.) 
_—— reg foto = 206, 0] (2.3.16) 


a, 2 pt 


(=1 m=— 
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and a is the intercept. All commutators are easily calculated, except for the 
one involving X~. The calculation, which is lengthy and must be performed 
carefully, yields the final answer: 


; : =p , 2 ij 
(M-, M-i] = =a Dana — al, JAn, (2319) 
where 

n 1 [| D— 26 
= —(26—D)4+- 2—2a}|. 23218 
| 7 )+ : | oo a| ( ) 

For this commutator to vanish, we must have 
Da DG, (4 ooest (2.3.19) 


This fixes both the dimension of space-time D and the intercept a for the 
model. 


2.4 BRST Quantization 


As in the point particle case, the BRST quantization method starts with the 
Faddeev—Popov quantization prescription and then extracts out a new nilpotent 
symmetry operator. The action is invariant under a reparametrization (2.1.36): 


bd Lub = BacOpOVv° a dub V° Zep oa BabO-dV° 
= Vb Vp ae VpdVq (2.4.1) 


(the second line has been written covariantly and contains two-dimensional 
Christoffel symbols). This allows us to impose the gauge constraint: 


Sab = ops (2.4.2) 
This, in turn, generates a Faddeev—Popov determinant: 
Arp = det(V,) = det(V-) det(V:). (2.4.3) 


This determinant can be calculated by exponentiating it into the action. To 
do this, we must introduce anticommuting ghost fields / and ¢ (see (1.6.22)). 
As before, these ghost fields make it possible to place the determinant of a 
derivative in the exponential. Let us define z = t + io; then: 
J ae 
L=— (50.X,0:X" + bd:c + bd-¢). (2.4.4) 
Ne 


Notice that this action is invariant under 
dX, = é[cd:X,, + cd:X,,], 
jei=nehaaec |. 
OG = Elieore |, (2.4.5) 
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5b = e[cd,b + 28,cb — $9.X,,0,X"], 
bb = e[20;b + 20;¢b — 10:X,,0:X"]. 


From this variation and (1.9.8), we can extract out the nilpotent BRST op- 
erator Q. However, it is also important to note that in general, given any 
Lie algebra with commutation relations [A,, An] = f,2,Ap, it is possible to 
construct a nilpotent operator Q [10] out of anticommuting operators c,, and 
Bey: 


Ge syncs, na eal (2.4.6) 
where 
{Cn, bin} = Onan (2.4.7) 


Thus, our nilpotent BRST operator can be written in this form: 


co 
Oey ce. (L* + ALE — 46,0): 
n=-—& 


co 


oS 1 
= Co(Lo ca a) as leaks aE L-nCn] = 2 y eon onUnam On _ n), 
(2.4.8) 


i n,m=—0o 
where L* equals the X-dependent Virasoro generator, and L® is the ghost 
contribution to the generator. At this point, there are two unspecified parameters 
in the above equation, the value of the intercept a and the dimension of space— 
time. Let us calculate the square of Q, which should be zero: 
Q* = E 3 (Fem Seren — Isr ye eam Nance (24,9) 
3, i 12 6 Mie th 

For this to vanish, we must fix the dimension of space-time to be 26 and the 
intercept to be equal to 1. 

As in the point particle case, we find that the physical states of the theory 
are given by 


Q |phy) = 0. (2.4.10) 
When we separate out the modes, we find that the lowest state satisfies 
(Lo — 1) 16) = 9, 
Lil?) = 9, (2.4.11) 


as before. 
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2 lrees 


So far we have quantized only free strings. The interaction of strings can be 
determined either through functional methods or through harmonic oscillator 
methods. Of the two, functional methods are by far the more powerful. In fact, 
we can view the harmonic oscillator method as just one specific representation 
of the functional method. For trees and even the first loop, oscillator methods 
provide a fast and convenient method of calculating the amplitudes, but for 
higher loops the oscillator method rapidly becomes impractical. The functional 
method, because of its vast versatility, can always be used either to derive the 
oscillator method for trees and the first loop, or to derive all loops. 

Historically, the dual model was first discovered accidentally as a quantum 
amplitude for the scattering of tachyons. These amplitudes possessed a remark- 
able property called duality, meaning that they could be factorized in terms of 
s-channel poles or t-channel poles. (This duality between s and ¢ channels is 
entirely different from the duality that we will find in M-theory.) Usually, in 
the theory of point particles, the Feynman diagrams sum the s- and r-channel 
poles separately (see Fig. 2.5), but the dual amplitudes already possessed poles 
in both channels for the s- and t-channel poles separately. For more complicated 
diagrams, we see that the same N-point amplitude could be factorized in any 
number of ways. Because the counting of these diagrams was not the usual 
counting of Feynman diagrams, for years it was erroneously believed that a 
true field theory interpretation of the dual model was not possible. 

Following the analogy with the point particle case, let us begin by defining 
the N-point scattering amplitude for tachyon scattering [11-13]: 


A, (ki Jame, hyve J aun f Dean f ox 


topologies 


N 
x Arp exp ff naoas [T1va J/g exp(ik; 5} 
| Pen f an I / Sepa ex”) ) (2.5.1) 


topologies 


This expression will be the fundamental path integral from which we will 
derive the theory of interacting strings. /t will be the single most important 
formula in the first quantized formulation. 

In the expression, momentum 4; from the ith tachyon flows into the boundary 
of the surface at a point <;. The string variable X/' is defined at that ith point 
<; Onithe boundary where momentum k; flows into the diagram. The measure 
of integration dj is an integral over the various z;, which we will determine 
shortly. 
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FIGURE 2.5. Duality of the Veneziano model. The four-point amplitude can be 
decomposed in terms of either s-channel or t-channel poles. This is in contrast to 
standard point particle field theory, which sums over both s- and t-channel poles. 
This property of duality extends to the N-point function. For this reason, it was once 
thought that a field theory of strings was impossible. A field theory of strings would 
be plagued by overcounting, especially at higher orders. 


This expression simplifies considerably if we take the conformal gauge. In 
particular, we get 


= 
[an [ oxo =e ~ [0 Dax) dea 21 hay Xt 


i=l 


= pe) du (Ie ) (2.5.2) 


a 


An 


In Fig. 2.6 we show how to simplify the string interaction diagram. By letting 
the string interaction length to go to zero for the external tachyons, we see that 
the string interaction surface can be reduced to an infinite horizontal strip in 
the complex plane, extending from o = 0 too = 7 and t = +00. 

Notice that the action, although it is no longer reparametrization invari- 
ant because we have fixed g,, = 5,,, 1s still conformally invariant. Thus, to 
avoid overcounting of conformally equivalent surfaces, we will take the set of 
topologies over which we must sum to be the set of all conformally inequivalent 

two-dimensional complex surfaces. 
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FIGURE 2.6. Conformal surfaces for open string propagation. In the p-plane. the 
surface over which a string propagates is a horizontal strip of width x. The wavy 
lines at the bottom corresponds to “zero width” strings or external tachyons. In the 
z-plane, the surface becomes the upper half complex plane. The mapping from one 
surface to the other is given by the exponential. 


Let the world sheet of the N-point tree be a horizontal strip of width z that 
extends horizontally in the complex plane. The x-axis corresponds to t and the 
y-axis corresponds to o. Let us now change coordinates to complex variables: 


rae ea de (2.55) 


This mapping takes this infinite strip, which describes the world sheet of the 
interacting string (with zero width tachyons), into the upper half of the complex 
plane. 

Fortunately, the functional integral is a Gaussian that can be evaluated with 
the identities presented in the previous chapter. Let us define Vecjeqr as the 
solution to the classical equations of motion: 


V?Xict = —2ina' J,,, (2.5.4) 
where 
N 
In) =) kin S(2 — 2), (25.5) 
t= 


where ¢; are points on the real axis of the complex plane that correspond to the 
external zero width tachyons interacting with the string. After a Wick rotation 


ALS) WESS iS 


in the t variable, this is just Poisson’s equation for electrostatics. To solve this, 
we need the Green’s function: 


V’G(z, 2’) = 27 8(z — 2’). (2.5.6) 


We must calculate the Green’s function, with Neumann boundary conditions, 
for the upper half-plane. The easiest way to calculate this is to borrow a trick 
from the theory of electrostatics, namely the method of images. Let us place 
a point charge at the point z’ in the upper half-plane. Consider another point 
charge at the point 2’ that is symmetrically reflected through the x-axis; Z’ 
is in the lower half-plane. If we are sitting on a point charge z in the upper 
half-plane, then the potential at that point is proportional to 


Gz, z') = In|z —z'| + In|z— 2. Osa 


Notice that if we are sitting on the x-axis so that z is real, then the derivative 
of the Green’s function normal to the x-axis is zero. Thus, these boundary 
conditions are precisely what we want, so (by the uniqueness theorem) this 1s 
the Green’s function for the upper half-plane. 

We can now insert this Green’s function back into the integral. The classical 
value of X that solves (2.5.4) is 


a ia’ | Gia ) dz. (2.5.8) 


Let us now make a shift in the integration variable: 
Xp > XpatXz- (25.9) 


We find, therefore, that the functional integrals can be performed using (1.7.10): 


1 ee , 
[ oxe®|-2 _f axyaextacti f .xtde| 
at 


exp {5 i J,(2)G(z, 2)J*(2) dz a:' 
| [exp {a’ky - k; In |z; — z;\} 
i iFj 
=[]la-2"™, (2.5.10) 


Lay 


II 


where 


1a — Se eke 
j 
Putting everything together in (2.5.11), we find 
Ay = few NE eae eee (25.11) 


1<i<j<N 
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(Notice that we have explicitly removed the “self-energy” term fori = j, 
which would be divergent. We can truncate the integral and still maintain the 
conformal properties of the theory. We will find that this truncation will also 
have to be performed in the harmonic oscillator method.) 

Now we must complete the last step, which is to fix the measure dj. 

Our first guess is that, when the amplitude is expressed in terms of z;, the 
measure is simply equal to one. This is the correct choice that is compatible 
with conformal invariance. To prove it, recall that earlier we said we must 
sum over all conformally inequivalent surfaces. Consider the set of conformal 
transformations that map the upper half-plane into itself, such that the real axis 
is mapped into itself. In general, the points on the real axis that are mapped 
into each other transform under a subset of the conformal transformations, the 
projective or Mobius transformations: 


,_ aytb 
~ cyed 


for reala, b,c andd such that ad — bc = 1. This set of four parameters defines 
a real matrix with unit determinant: 


o (2.5.13) 
a? 25. 


In general, the group defined by the set of all real 2 x 2 matrices that has 
unit determinant is SL(2, R). Notice that this group of transformations can be 
generated by making successive transformations: 


(2.5.12) 


y 


(25.14) 


Thus, we wish the amplitude, including the contribution of the measure, to be 
projectively invariant. 

Let us make a projective transformation on the integrand to see how it 
transforms: 


[|a=2y"" =] ]G@ 2)" | J@ ez. (2.5.15) 
i<j k 


i<j 


We want our measure to cancel out the noninvariant term in the above expres- 
sion. Let us take our measure to be the number | and the integration region to 
be fixed by z; > <;,1. Then there is one last complication. We must still “fix 
the gauge” for projective transformation or else we will have overcounting. We 
must integrate once and only once over each projectively distinct configuration 
of the <; variables. If the external momentum flows into the upper half-plane at 
points given by <;, then we are allowed to fix three of these points at random. 
This corresponds to “gauge fixing” the projective invariance, which selects out 
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only projectively inequivalent parametrizations. Our final result is 


Ci 9% = Zi41) 7 
d 
! i a I] dzi, (2.5.16) 


where we have explicitly removed the contribution from the three fixed points 
Za,b.c: 


Vang = dz, Gey dee, — Zs) (Zp — Zc) (Ze — Za) (2.5.17) 


where z, »,- are three points that can be randomly chosen on the real axis. Notice 
that we have fixed three variables so that we only integrate over projectively 
inequivalent configurations. (If we integrated over these three variables, we 
would have overcounting of the integration region. We could be integrating 
over an infinite number of copies of the same thing.) 

The simplest choices for the three fixed points are 


Z1j — W, 
ay 
rae (2.5.18) 


In this configuration, our final result for the N-point amplitude becomes [14— 
19]: 


a= f Tae I] (Zz —2yr"™, (2.5.19) 


2<i<j<N 
where the region of integration is 
OO = 2 2 22 — 1 > Z3°+-2y-1 > Zn = 0. (2.5.20) 


This is our final result for the N-point amplitude, which we derived using 
only functional techniques. 
Let us summarize what we have done: 


(1) We have taken the string lengths to be zero for the external tachyons, so the 
world sheet of the N-point amplitude was a horizontal strip in the complex 
plane that has width 2. Momentum from the external tachyons flowed into 
this strip at designated points along the real axis z; (see Fig. 2.6). 

(2) By making a conformal transformation from the strip to the upper half 
complex plane, we explicitly solved for the Neumann function, which we 
calculated using a trick from electrostatics, the method of images. 

(3) Projective invariance fixed the measure of integration to be the number 1. 
We ordered the external points z; > z;+1. 

(4) We “fixed the gauge” left over from projective transformations; i.e., we 
fixed three of the N points of the integrand. 


The previous expression for the N-point amplitude (2.5.19) considered only 
the case with external tachyons entering the world sheet at selected points on 
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the boundary. In principle, we can have particles with arbitrary spin on the 
boundary. For higher spins, we have the same factor ek * | which represents 
part of the Fourier transform, multiplied by the polarization tensor of the high 
spins. Thus, the vertex function for the tachyon is not actually e’“*, which 
is universal for all spins because it is part of the Fourier transform. The true 
vertex for the tachyon is actually just the number 1. 

Spin-2 vertices can be represented as 


V = /ge?8,X"d,X"e,,e%%™ , Q52i15 
where &,, is the polarization tensor. For the general case: 


V — af gg leita gtam-142m DG XI , i) Mele 2m on eee X# , (2.5.22) 


Qn 


2.6 From Path Integrals to Operators 


We used functional methods to calculate the N -point scattering amplitude. The 
only nontrivial part of the calculation was the integration over conformally in- 
equivalent two-dimensional complex surfaces, which determined the measure 
of integration of the amplitude. 

The same calculation can also be performed using the harmonic oscillator 
formalism, which, as we have stressed, is nothing but a particular represen- 
tation of the path integral for which the Hamiltonian is diagonal. For trees 
and the first loop, the harmonic oscillator method is quite simple because the 
Hamiltonian is diagonal on the Fock space of the harmonic oscillators. How- 
ever, for higher loops this is no longer true: the harmonic oscillator method 
becomes increasingly difficult and impractical. The path integral method thus 
provides the only systematic way in which to analyze the higher-loop ampli- 
tudes with relative ease. (The calculation of anomalies, however, is easier in 
the harmonic oscillator formalism, where the integer index 7 serves as a cut- 
off for the theory. In the path integral formalism, we must use point-splitting 
techniques and other regulators, as we shall see in Chapter 5.) 

In the functional formalism, we know that the propagator for free strings is 
given by 


al dre" IXs) = (Xe —— 1%). (2.6.1) 


Similarly, we also know that, by inserting a complete set of intermediate 
states within the path integral, the vertex function for the ith tachyon is 


(x esate i (2.6.2) 
where 


X' = X,(0 =0,t=1). (2.6.3) 


2.6 From Path Integrals to Operators i 


Because of the identity 


Ix) f Dx (xI=1 


we can remove all path integrals from the N-point function and make the tran- 
sition from the path integral formalism to the harmonic oscillator formalism. 
For example, let us begin with the path integral expression for the N-point 
tachyon amplitude and make the substitution to harmonic oscillators: 


| N 
An = f Pxaues Ter 


i=) 
— [ anes r 2 » ehki X(t) ix) f DX; (x, | cae 
ee [au {--- en x)) | Die aaa 
= il | [az (0, ky | elk2X(0) 6-11 piks X(0)... \0, ky) 


= (0, ki| V(k2)DV (k3)+-- V(kw_1) |0, kw) . 


Several points in this derivation must be clarified. First, in the path integral 
formalism, we found that we had to omit the i = j contribution to the inte- 
grand in (2.5.10). Similarly, we also have to truncate the harmonic oscillator 
formalism. The expression for the vertex function (2.6.2) is formally infinite 
when we convert to harmonic oscillators. For example, if we naively take the 
vacuum expectation value of the exponential, we arrive at a divergent sum. 
The normal ordered expression is the one we desire, which has finite matrix 
elements. Let us exponentiate (2.2.11): 


[o 2) 
(oe. |e 
aexpTKyex (6 — 0, 7) = exp f ; y ent gik-x(t) 
n 
| 


= Wn —int; 
«exp | ke Se } (2.6.4) 


(Normal ordering consists of moving all creation (destruction) operators to the 
left (right) so that the resulting operator has finite matrix elements.) 

Second, in the transition from path integrals to operators we used the fact 
that the Hamiltonian is the generator of t shifts. The expression Lo — | acts as 
the effective Hamiltonian for horizontal displacements in the complex plane. 
We will find the following formula useful: 


De = fay) (2.6.5) 
for any function f. Thus, we can define 
y*Vo(ki)y ° = Viki, =e"). (2.6.6) 
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We used this expression in the transition from path integrals to harmonic os- 
cillators when we converted the vertex function at t; to the vertex function at 
the origin. 

Third, in our derivation of the harmonic oscillator formalism we introduced 
the vacuum state |0, k). Notice that the functional formalism started with the 
world sheet of the interacting strings being an infinite horizontal strip of width 
m in the complex plane. The external lines were placed along the real axis. 
Therefore the effect of a string coming in from negative (positive) infinity cor- 
responds to the functional integral over a semi-infinite strip. Thus, the vacuum 
state |0) is the functional integral! of the string theory over a semi-infinite strip. 
We can represent the tachyon vacuum with momentum k as: 


10;k) = e'** [0;0) , (2.6.7) 
where 
a, |0;0) = 0, n> 0. (2.6.8) 


Putting everything together, we can now convert the path integral to harmonic 
oscillator form: 


N-1 
An = auf Dxe> I] dz; exp[ik;X“'(o = 0;1;)] 
ts 


= (03 k1| Vo(k2)D Vo(k3)D --- DVo(kw-1) |0; kw) (2.6.9) 


It is important to realize that the harmonic oscillator expression is a direct 
consequence of the functional formalism. 

Fortunately, this amplitude is easy to calculate. As an exercise, let us first 
calculate the four-point function explicitly: 


Ag = (0; ki| V(k2)DV(k3) |0; ka) 


| eg) oo 
= dxxk-! (9 eft Clie 
| Lee ( les | oe ; R exp k, ) ; |0} 


! 
/ dag rr = a 
0 


_ Peasy h(a) 
P(—a(s) — a(#)) 


| 


= B(—a(s), —a(t)), (2.6.10) 
where 
a(s)=1+4s, 
s=—(ki thy, 
t = —(ky +k3), 


“= —(ki +h, (2.6.11) 
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are the Mandelstam variables and R is the number operator: R = °°, na‘ay. 
Thus, the four-point scattering amplitude has an elegant form. In fact, it can 
be represented as the Euler Beta function. This Beta function has fascinating 
physical properties that first excited the imagination of high-energy theorists 
in the late 1960s. It was discovered by G. Veneziano and M. Suzuki quite by 
accident as they were looking for a way in which to satisfy finite energy sum 
rules for the hadronic S-matrix. At that time, theorists wanted to construct a 
scattering amplitude for hadrons that had the following of G. Chew’s criteria 
for an S-matrix: 


(1) Unitarity; 

(2) Lorentz symmetry; 
(3) CPT invariance; 
(4) Analyticity; 

(5) Crossing symmetry. 


To this list, theorists sometimes added: 
(6) Regge behavior, i.e., 
A(s, t) ~ 5% (2.6.12) 


for large s and fixed tr, and also 
(7) Duality, i.e., 


A(s,t)= > esl) => Os, | (2.6.13) 


2 2 


The list of “axioms” for the hadronic S-matrix was so large that physicists 
were pessimistic that they could ever be satisfied. Then, when the Euler Beta 
function was accidentally discovered while thumbing through a math book, 
these two young physicists were surprised that this analytic function satisfied 
all but one of these axioms! For example, Regge behavior can be shown by 
using Sterling’s approximation on the I function: 


5a 
T(x) ¥ V27—=e™. 2.6.14 
(x) © Vn ae (2.6.14) 
Duality can also be shown by evaluating the poles of the integrand near the 
limits of integration x = 0 and x = 1: 


1 @@) + D@) + 2)---@M +”) 


7 he (2.6.15) 


A(s, t) ~ 

A similar expression exists for the t-channel poles. In fact, all the postulates 

but the first, unitarity, can be satisfied. (Unitarity is violated for the simple 

reason that the analytic structure of the amplitude is a series of poles in the s- 

and t-plane. A true unitary amplitude, however, should also have imaginary 

parts instead of poles and cuts along the real axis, as we shall see in Chapter 5). 
Using operator methods, we can now solve for the N-point function. 
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Ba, =| 23 24 2N—1 


Zz, =00 Zy=0 
FIGURE 2.7. The N-point function. Projective invariance allows us to fix three of 
the N variables. The most convenient parametrization is to take the first variable to 
be 00, the second to be 1, the last to be 0, and all the others to be ordered between | 
and 0. 
The N-point amplitude can be written as (see Fig. 2.7): 


Ay = (0; ky | V(k2)D-- -V(ky-1) |03 kn) 
Tl ld 

-/ 1 % (0: ki VeVi Vile) eel | Re 

(2.6.16) 


where 


Nine a vero Xi. (2.6.17) 


(Notice that we commuted the factors of x* to the right, until they annihilated on 
the vacuum.) This expression can most easily be simplified using the coherent 
state formalism. Let an arbitrary state of the Fock space be represented by 


|A) = pas —(a'y \0) =e" |0). (2.6.18) 
Then we have 
(eed 
x7@1)) = |xA), (2.6.19) 


eM" A) = |A+u). 


By using these identities in succession, we find 


V(kji. Yi) , 
(0; ki i — sk) = []or— ee. (2.6.20) 


—2 i<j 
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Thus, we find the same expression for the N-point function (2.5.19) that we 
derived using functional methods: 


_ pN=l 
ae / Tl]: [T] oy)", (2.6.21) 
i—3 


2<i<j<N 


where the y’s are ordered along the real axis as before. Notice that we have 
already fixed the values of three points along the real axis to be 0, 1, and ov, 
as expected. 

Using oscillator methods, we can show that the N-point function is cycli- 
cally symmetric, which is not obvious when we write the amplitude as the 
sequence VDV DV DV DV. We use the following identity, which expresses 
what happens when two vertices are pushed past each other: 


[V (ki, vi)V (ko, yo) = V(ko, y2)V (ki, yi) exp[2e'wik; -kje(y1 — y2)I, 
(2.6.22) 
where e(x) = 1 if x > O and e(x) = —1 ifx < 0. 
We also have to rewrite the vacuum |0;ky) in a form where it resembles 
the other vertices, so we can calculate the properties of the amplitude under a 
cyclic rearrangement of the vertices. We will use the following identities: 


V(k, y) 
y 


lim (0;0) = |0;k) , 
y=0 


lim (0; 0| yV(k, y) = (k3 0]. (2.6.23) 


Written in this fashion, the tachyon states at the extreme right and left are no 
longer treated unsymmetrically from the others. The vertex on the right is now 
defined at 0, and the vertex on the left is defined at 00, and all other vertices 
occur within that interval. 

Thus, when we push the Nth vertex completely around the amplitude, we 
get 


(V(ki, 1) °° V(kN, yn)) 
= (V(ky, yw) V(Ki, v1): +: Vkn—1, Yn-1)) 


N-1 
: x exp atin Da us| (2.6.24) 


i 


Notice that the last factor vanishes if we have conservation of momentum: 
N 
a k; = 0, (2.6.25) 
i=1 


where we have also used the on-shell condition a’k;, = 1. We see that only on- 
shell do we have an amplitude that is cyclically symmetric. (Historically, this 
was another reason why physicists believed that a field theory of strings was 
probably not possible. A field theory, by definition, is an off-shell formulation, 
while the marvelous features of the Veneziano model worked only on-shell.) 
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2.7  Projective Invariance and Twists 
In addition to cyclic symmetry, we saw that the integrand of the N-point 


function was also Mdébius invariant; i.e., the function remains the same if we 
make the following change of variables: 


pa (2.7.1) 
cy+d 
where 
ad—bce=1. Qa 


We showed Mobius invariance in the functional formalism. Now let us make 
a Mobius transformation directly on the harmonic oscillator matrix elements: 


V(ki, y1) Vi(kn, yn) 


(ORO) << 
yi YN 
Viki. y;)  Vekw. y; = oa 
= (0; 0] vate ae 10: 0) [ |‘ Digs? (2) 
yy /N i=] 


AS we saw earlier, this factor, in turn, can be canceled by the transformation 
in the measure 


du —> du | [@ — cy). (2.7.4) 


It turns out that we can actually calculate the generator of these infinitesimal 
transformations on the vertex function. Not surprisingly, the generator of these 
Mobius transformations will be the Virasoro generators. Let us define 


e= iy Pepa”. (75) 


Then 


n 


TV (k;, zt a = ( = ea'k? Erne") V (ki, zi), 


[Ln, V(ki, z)) = 2" Ee + na't?| V(k;, 2;). 2.7.6) 
We say that the vertex V has conformal weight equal to ak. (The conformal 
weight will play an important part in the proof that the string model is ghost- 
free. In Chapter 4, we will explain the origin of the conformal weight, which 
labels irreducible representations of the conformal group generated by the 
Virasoro L,,. For a vertex function to be properly defined, it must have weight 
1 on-shell, which is true for the tachyon.) 
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Clearly, the L,, generate conformal transformations when acting on the ver- 
tex functions. In fact, we can show that a specific representation of the Virasoro 
algebra on conformal fields is given by 


L, = —z"*'8.. 


This representation satisfies the definition of the conformal algebra in (2.2.27) 
(minus the central term) and generates conformal transformations in functions 
of the complex variable z. 

Weare interested in the subgroup of the conformal transformations that maps 
the upper half-plane into itself and the real axis into itself, i.e., the projective 
subgroup. This subgroup is generated by only L,, L_;, Lo, which generates 
SLi. Ry 


[Z, Lo] = Dts 
SL(2, R): [Lo1, bol = —Laqy (257-7) 
[Pst 2 bg: 


We can easily calculate the transformation of a vertex function induced by 
these generators: 
ew V(y)e"! = V[y(1 — ay)", 
eo V(yje = V[e"y], (2.7.8) 
Bev je = Vy cl. 
If U is an element of SL(2, R), then we also have 
U|0-0) 10; 0) . (2.7.9) 


(At first this might seem surprising , because we know that real states are 
annihilated by L,, for positive, not negative, integers n. However, |0; 0) is not 
a real state. It is annihilated by L_; because 


L_; |0;0) ~ ao -a@_; |0;0) = 0 (2.7-10) 


because ay |0;0) = 0. Thus, |0;0) corresponds to the true vacuum of the 
SL(2, R) group, which is not a real state of the theory. When we multiply |0; 0) 
by e’**, then it becomes a real state (because x has commutation relations with 
ao) and is annihilated by L,, for positive n). 

Putting everything together, we have 


V(ki,y1) Vik, yn) 


(0;0|U-'U += U'U (0; 0) 
yI YN 
Viki, y} Vik, y% Z 
= (0;0| ay) Rx Viki Yn) 10: 0) []@ oye ae el) 
yj YN at 
which reproduces (2.7.3). 


Besides the Virasoro gauge group, one other feature of string theory is totally 
missing in the point particle case, and that is the “twist” operator. Remember 
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that a string sweeps out a two-dimensional world sheet, not just a single line. 
Thus, if we were to twist the world sheet, a topologically inequivalent world 
sheet would be swept out. At the first loop level, for example, this is the crucial 
topological difference between a disk with a hole and a Mobius strip. 

A simple expression for the twist operator 2 [20] can be derived from the 
observation that a vertex function located on the real axis should convert into 
a vertex function located on the top of the strip: 


QV(o = 1)Q7|! = Vio = 0). ogi. 


Notice that the only change in the vertex function under this transformation 
is that every oscillator at the nth level is multiplied by (— 1)". Thus, the twist 
operator must be 


G41) = (2.7.13) 
where 
oO 
Ne oenc (2.7.14) 
ri i| 
Notice that this satisfies 
Q=1 ene 


as it should. This establishes the twist operator up to an overall sign. However, 
since N even states are charge conjugation even and N odd states are odd under 
C, this fixes the value of Q that we have chosen in the expression. 

There is an equivalent method of obtaining the form of the twist operator. 
Notice that the action of the twist operator on a tree is to reverse the orientation 
of the external lines (see Fig. 2.8) 


82Vo(k1)DVo(k2) +++ Vo(Ky—1) (Okay) = Vo(kw DVo(Ky -1)- >: Volks) Oc ky). 
(2.7.16) 
Notice that the cyclic ordering of the tree has been reversed by the twist op- 
erator. To extract the operator that will perform this reversal, let us first write 


ieee N-1 N N N—1 2 


) 
i 
" 


ileez N-1 1 


FIGURE 2.8. Action of the twist operator. The twist operator twists the factorized leg, 
which ts equivalent to flipping the entire diagram upside-down. By duality, however, 
we can always rewrite the diagram in the original configuration (with external legs 
renumbered). 
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the expression in terms of y variables: 


N-1 
Q / [ [ay Volks. 91) «++ Volk, yw) 1050), (2.7.17) 
= 


where we take the limit as y; > 1 and yy > 0. 
When written in this fashion, we now see that we can reverse the cyclic 
ordering of the amplitude and interchange y, and yy by a change of variables: 


Y= 45 


Now we wish to write down an operator that will perform this change of 
variables. Previously, we wrote down the generators of SL(2, R) that will 
perform a projective transformation on a vertex function. By examining this 
change of variables, we easily find that the twist operator must be given by 


Q = (-1)eE, (2.7.18) 


Although the two forms of the twist operator seem totally different, they are 
actually identical on-shell. Since the Veneziano amplitude is strictly on-shell, 
we have the freedom of choosing either form of the twist operator. 


2.8 Closed Strings 


So far, our discussion has been specific to open strings, where the external 
tachyons attach themselves to the endpoints of the conformal strip swept out 
by the string. Poles emerge in the Veneziano model when two points z; and 
z; along the edge of the strip come close together. Let us now analyze the 
Shapiro—Virasoro model, which is based on closed rather than open strings 
and corresponds to a path integral taken on the tube (or sphere) swept out by a 
closed string. The pole structure of this model is much larger than the original 
Veneziano function, because external states can attach themselves anywhere 
along the surface of the tube as it moves in spacetime [21]: 
I(—3a(s))P(—ja(t))P'(— 5a(u)) 

PI-3eO + ow) 3OW + aNII-2EO) +a) 
This function, unlike the earlier Veneziano function, has poles simultane- 
ously in all three channels, rather than just two channels. This was quickly 
generalized to the N-point function: 


AUS. le) 


Ay = | dp TI ta —zl, (2.8.2) 
Q<i<j<N 
where 
lil d*z; 
dt = |Za — nl" |2p — 2el"|Ze — Zal’ (2.8.3) 


Dp: By) Bd 
Ged aya = 
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As in the case of the Veneziano function, the poles in this amplitude emerge 
when two variables z; and z; come close together, except now the poles can 
occur anywhere in the complex plane, not just on the real axis. 

Our starting point for the quantization of the closed string is (2.2.21), where 
we decompose the string X,,(0) and its conjugate into normal modes. The 
canonical commutation relations remain the same as (2.2.7), which leads to the 
Hamiltonian given in (2.2.22). As in the open string case, we can factorize the 
amplitude into vertices and propagators in the harmonic oscillator formalism, 
but now several important differences emerge: 


(1) We now have two sets of mutually commuting harmonic oscillators a, and 
a, to sum over, rather than just one set as in the open string case. 

(2) The Virasoro conditions now consist of two sets of conformal generators 
L,, and L,, acting on the physical states: 


Li |) = Ln |) = 0, 
(Lo — 1) |b) = (Lo — 1)|¢) = 0. (2.8.4) 


(3) We must integrate over all shifts in o, because the closed string states 
should be independent of where we chose the origin of the o coordinate. 

(4) The amplitude is not just the sequential product of vertices and propagators. 
Because external lines can occur anywhere in the complex plane, we must 
sum over all different orderings of the external lines. 

(5) Fixing the vertex function to have weight | and using the anomaly cancel- 
lation argument show that the intercept for the closed string model must 
be 2 and a’k* = 2, meaning that the theory necessarily contains the mass- 
less graviton. In fact, the linearized general covariance gauge symmetry 
of general relativity simply emerges as the lowest-order Virasoro gauge 
symmetry (we will expand on this in Chapter 7). 


Let us begin by discussing the propagator: 


ae I 7lo—25Lo-2 222 
some eer & 4 az 
21 |z|<1 
sin z(Lo — lay i 


(2.8.5) 


RiLy Lane iae 


This actually has a simple physical interpretation. Notice that the factor of 
the inverse L’s simply is the usual propagator of a closed string. The function 
involving the sines, however, is equal to zero unless 


Lo = Lo. (2.8.6) 


This, in turn, can be represented as 


‘ do exp[i2mo(Lo — Lo)). (2.8.7) 


2.8 Closed Strings 89 


FIGURE 2.9. Conformal surfaces for closed string propagation. In the p-plane, the 
string propagates on a horizontal strip of width 27r, such that the upper and lower 
horizontal lines are identified with each other, which is topologically equivalent to 
a tube. In contrast to the open string case, the external lines attach themselves to 
the interior of the surface, not the boundary. In the z-plane, this surface maps to the 
entire complex plane by the exponential map. 


There are two ways to interpret this operator. If we explicitly perform the 
integral, we have the operator 6(Lo — Lo), which is a projection operator 
acting on the full Hilbert space that eliminates states |6) which do not satisfy 
(Lo — Lo)|¢) = 0. Second, we notice that this operator is the generator of 
o rotattons by one full cycle, so that the propagator simply expresses the fact 
that, as aclosed string moves, we must integrate over one full cycle. The closed 
string amplitude is therefore independent of where we chose the origin of the 
o parametrization. (This constraint will have important consequences when 
we discuss the compactification of the closed string and the heterotic string in 
Jater chapters.) 

In Fig. 2.9, we see that the world sheet originally consists of a horizontal 
strip in the complex plane that is 27 in width, such that the top and bottom 
sides are identified with each other (creating a long horizontal tube). External 
tachyon lines can enter this tube internally. We see that by exponentiating these 
coordinates, we can transform this horizontal tube into the entire complex 

plane. External lines then couple to all points within the plane. 
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As before, the vertex function is once again given by :e’**:, where the string 
X is defined at the point in the complex plane where the external momentum 
is entering. Because the two sets of harmonic oscillators commute, the vertex 
function factorizes into the product of two open string vertex functions. The 
final expression for the N-point scattering amplitude of tachyons is given by 
(2 2ee23)|: 


Aw = ) | (0; ki] V(k2)D --- V(kw-1) 103 kw). (2.8.8) 


perm 


The summation over all possible permutations of the ordering of the external 
lines guarantees that the z, variables can roam freely in the complex plane. 


2.9 Ghost Elimination 


We have developed the harmonic oscillator formalism using Gupta—Bleuler 
quantization. In the conformal gauge, the theory maintains manifest Lorentz 
invariance and, in fact, becomes a theory based on free fields. This accounts 
for the fact that the theory at the tree level is very easy to write down. 

The price we have to pay for this simplicity, however, is that we must impose 
the Virasoro constraints directly onto the Hilbert space to eliminate ghosts. 

In general, the proof that ghost states do not couple in tree diagrams is quite 
simple. (The proof breaks down in loops, where extreme care must be exercised 
to properly eliminate ghost states.) Let us define a real physical state as one 
that satisfies the Gupta—Bleuler constraints: 


Lb, eS: n> 0, 


[Lo — 1] |R) = 0. (2:91) 
Let us define a spurious state as one that does not couple to real states: 
(o[ Ry = 0. @:9.2) 
We can conveniently represent such a state as 
|S) = L_n Ix) (2.9.3) 


for some state x. We now wish to show that spurious states do not couple to 
(reesaises 


(| Tree) =O); (2.9.4) 
To show this, we need two more identities: 
[Ln — Lo —n + 1]Vo = Vo[L, — Lo + 1], 


l 
a 
: 1 lagen | 


(This can easily be proved from (2.7.6), which in turn crucially depends on the 
vertex function having conformal weight | with external tachyons satisfying 


= Dis 5) 
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a’k* = 1. Thus, the restriction on the conformal weight of the vertex is crucial 
for the elimination of all ghosts in the theory.) 
From these two identities, we can now easily show 


a eee Va D Vy Dik: Va l0\ = 0, (2.9.6) 


This is the direct result; it shows that the L’s can be pushed to the right until 
they finally annihilate on the vacuum. 
In summary, we have shown that spurious states do not couple to trees: 


L,|Tree) =0 —- (S|Tree) = 0. (2.9.7) 


This means that we do not have to make any special change in the functional for 
tree amplitudes because of the presence of ghosts. The ghosts states propagat- 
ing within the tree amplitude automatically cancel by themselves. However, we 
find that the loop functions do indeed have problems with ghosts propagating 
internally. This is because 


(S| Tree |S) 4 0. (2.9.8) 


When we naively trace over the trees to obtain loops, we thus inadvertently 
include the presence of ghosts, which must be factored out explicitly: 


Atcop = )_, (n| Tree |n) . (2.9.9) 
The sum over |n) explicitly contains ghost states. 

The fact that ghosts decouple from trees and only contribute to loop diagrams 
is precisely equivalent to what happens in the Yang—Mills case. Faddeev—Popov 
ghost contributions to the Yang—Mills theory do not contribute to the trees of 
the theory, but they do contribute to the loops. The Faddeev—Popov ghost 
contribution is (see (1.9.30)): 


cd, D* (A)c. (2.9.10) 


Notice that this produces the following interaction of the gauge field A with 
the ghost field c: 


Lip SUC AC: (29711) 


This coupling means that a single ghost cannot couple to a tree diagram con- 
sisting of gauge fields. They can only contribute to loops, where the c fields 
can circulate internally. Thus, particular attention was required to guarantee 
that the Faddeev—Popov ghosts canceled against the negative metric states in 
the loops. 

The proof that the Virasoro conditions completely eliminate all possible 
ghost states is, however, an extremely complicated task. In principle, the ability 
to go to the light cone gauge is normally sufficient to prove that the theory is 
ghost-free. However, we can never be sure that the quantum theory doesn't 
introduce anomalies that destroy this fact. Therefore, it becomes important 

to check explicitly that the Fock space is ghost-free in the Gupta—Bleuler 


o2 2. Nambu—Goto Strings 


formalism. There are two independent proofs of this theorem, neither of which 
is very simple, due to Brower, Thorn, and Goddard [24, 25]. The reader may 
skip the following discussion. 

Because of the numerous details involved, let us first sketch our strategy for 
the elimination of ghosts. We would like to construct a set of physical operators 
Vi and V> such that: 


(1) they commute with the Virasoro generators: 


[Ln Vp] = 0, 
[Ln V,] = 9; (2.9.12) 
(2) the V; and Vz generate the physical Hilbert space: 
DRY) = Voy Ving °° Voy V cp” V See (2.9.13) 


such that the states have either positive or zero norm, but never negative 
norm. 


The original Fock space of D-dimensional harmonic oscillators should be 
equivalent to the set of states generated by these (D—2)-dimensional transverse 
operators, the — component of these oscillators, and the original Virasoro 
operators: 

[v. 


—n? 


ees en (2.9.14) 


The L_, generate ghost states, so that by only taking the states V', and V_, 
we will generate the correct ghost-free states. This is the desired ghost-killing 
result that we now want to prove. 


Let us begin by first defining 


| <r 
A,=> V' (nko, t) dt, (2.9.15) 
2n 0 
where 
V' (nko, T) = Xb elthouX"(e) (2.9.16) 
where kp is a null vector: 
Ue == (t} 
ky =-1, (2.9.17) 
en. 


We can view this vertex operator as the vertex for the insertion of a massless 
vector particle. This particular vertex function was chosen for the following 
reason. Notice that, because Ao is a null vector, the Virasoro generators have 
the following commutation relations with this vertex operator: 


; d Imt 
[Lm, V(r)] = -i—(e"™"V(z)). (2.9.18) 
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Notice that this is a total derivative. When integrated over a circle, this means 
immediately that 


[Lm, Ai] =0. (2.9.19) 


Thus, we chose the form of the vertex function such that we fulfill the first 
criterion: the A’s commute with the Virasoro generators. So far, however, we 
have no constraints on kp other than that it is a null vector. We will need to 
place more constraints so that we fulfill the second condition. 

Let us start with a ground state vector of momentum po, such that po and ko 
satisfy 


a’ p2 = 1, 
Rp, = 1. (2.9.20) 


Now let us apply the vertex operator A’, on the state vector |0; po). The resulting 
state has momentum pp + nko. We want this resulting state to satisfy the mass 
shell condition as well. As a consequence, we demand 


cM? = —} (po + nk =—-1—n. (2.9.21) 


Thus, we demand that the momentum vector kp be constrained to be a null 
state so that the vertex operator commutes with the Virasoro generators, and 
we demand that ko - po = | in order to generate additional transverse mass 
shell states when acting on the ground state. 

Now, we must fully implement the last condition, namely that these op- 
erators, in fact, generate the entire physical space of states. There are some 
problems here. At first, we might suspect that the — modes can be created by 
simply covariantizing the value of ky. Normally, since the conformal spin of a 
product of operators is just their sum, we expect this to be true. However, the 
normal ordering destroys this. In fact, we find that the — components do not 
commute with the Virasoro generators: 


[Lee eae" (-i- +m) Kee * : tlm emme” (210,02) 


In order to remedy this difficulty, we must add an extra item to the definition. 
The complete definition is now 


F d sie 
V4 (k, 7) = 1X" kX + sik" (log k jee, (2.9.23) 
a 16 


It can be shown that the above combination has conformal spin | and reduces, 
when we take the transverse components, to the previous expression when we 


take its integral. 
Next, let us take the commutator between these fields: 


[V4, V2] = nke Ve, — keV, + Ceo ns (2.9.24) 


m+n 
where 
CH = Imi ki ky + mn””. (2.9.25) 


m 
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This allows us to write 


ve ae On0s 
[Vi, Vi] = mbij5n,—n: (2.9.26) 
[ ge —NV i ans 


[ Ven Sti nV, + 2m? Omen 


Notice that the + components are trivial and that the commutation relations 
of the transverse operators are exactly the same as for the usual harmonic 
oscillators. Now let us redefine the — modes once again: 


D—-2 


Vi=V,- See (2.9.27) 


Putting everything together, the final commutation relations are 


[Vinw Vi] = m8" bn —ns 
[V., ViJ=0, 


[V5,V-J=(m-a)v, i Oe (2.9.28) 


morn 


[oe oF 


This is the final set of commutation relations. Notice that new V~ , Operators 
commute with the original V/, operators and that they both commute with the 
Virasoro generators. Thus, we have now created a new Hilbert space of linearly 
independent operators that we can use to replace the original Fock space: 


fa") > (Vie eee (2.9.29) 


The physical Hilbert space is thus generated by the operators {V',.\V~,}. 
which are equivalent to the DDF operators [26] which generate the physical 
space in the light cone gauge. Furthermore, we know that the following state 
has zero norm: 

VeVi Vv ae0). (29330) 


—"N 


This can be checked explicitly by taking the norm of the state and using the 
commutation relations of the V,,. Thus, we have the final statement, that the 
set of states generated by 


{viVv:,} (2.9.31) 


have either positive norm or zero norm, but never negative norm. This com- 
pletes the proof that the Virasoro conditions completely remove the ghosts 
from the Hilbert space. 
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2.10 Summary 


The first quantized theory of strings is remarkably similar to the theory of point 
particles, except for the nontrivial addition of gauge symmetries representing 
the reparametrization invariance of the world sheet. As in point particle theory 
(see (1.4.16), we have three equivalent actions for the string: 


2 
first-order (Hamiltonian) form: L = P,,X" +ma'd| P? + = 
 a(2mo)e 
Te ae eee ote mice 
a POn Xa 2) eo, 


—l 
second-order form: L = Trg VBR" aX appx pment LOL) 
Tt at! 
: l : : 
nonlinear form: L = ——(X2 xX? —(X,X#)*)!?, 
Da ee 
Let us summarize the similarities found in the point particle case and the 
string theory when expressed in path integral language: 
He) One). 
length — area, 
Dx = [| | 4x,(t) > DX = [| 4X,.@, 7), 
LT 


[L,T,0 


xj xe 
| j Dxei | Lix)dt aa i! ; DXei f L(X)dodt 
eG; xX, 


N . N . 
ieles= ay tes 
t—! =| 


graph — manifold, 


! ¥) 
ptm? =0-+ (7, +) —0 


Our strategy for quantizing the string action is to write down the symmetry 
of the theory, extract the currents from this symmetry, calculate the algebra 
satisfied by these currents, and apply these currents onto the Hilbert space in 
order to eliminate ghosts. The strategy we follow throughout the book is 


Action — Symmetry — Current > Algebra — Constraints — Unitarity. 


All three actions possess reparametrization symmetry, which generates the 
Virasoro algebra: 


re 2 
i al daei”” ES : (2.10.2) 


96 2. Nambu—Goto Strings 
The algebra generated by these operators is 
D 
[Ln Lm] = (2 — m)Lntm + a — N)bn,—-m: (2.10.3) 


As in the point particle case, there are three ways in which to quantize the 
theory. 


Gupta—Bleuler Quantization 


In the Gupta—Bleuler formalism, we fix the gauge: 
Sab = Sab (2.10.4) 


and the resulting action breaks reparametrization invariance but maintains the 
subgroup of conformal transformations: 


J v2 12 
L = (XG, ~ Xp). (2.10.5) 


This Lagrangian, of course, propagates negative metric ghosts associated with 
the timelike mode of X. To eliminate them, the Gupta—Bleuler quantization 
method states that the state vectors of the theory must satisfy 


Ln |) = 0, n> 0, 
(Lo — 1) |¢) = 0. 2.10.6) 


This means that the constraints are constructed to vanish on the Hilbert space. 

Although the action in this formalism is quite elegant. the price we pay is 
that the elimination of ghosts on state vectors is quite difficult, especially for 
loops. In fact, a large mathematical apparatus is necessarily to show that all 
ghosts can be eliminated in this gauge. 


Light Cone Quantization 
In the light cone quantization program, we set 
Xe pet. (2.10.7) 


The advantage of this approach is that we can eliminate all the redundant 
ghost modes of the theory from the very start in terms of only the traverse 
components. The disadvantage is that the formalism is awkward and we must 
reestablish Lorentz symmetry at every step. The surprising feature is that 
the Lorentz generators of the broken theory close only in 26 dimensions. 
Specifically, the problem occurs with the commutator: 


; Se, an: 
[M '.M/)=— ) [al a/ —a!,a/JA,, (2.10.8) 
n= 
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where 


D— 26 
12 


n 1 
A, = —(26—D = = ; aoe 
1! +2] +2 2a| (2.10.9) 


For this commutator to vanish, we must have 


D=2%6, a=1. (2.10.10) 


BRST Quantization 


The BRST method combines the best features of both approaches. We still 
maintain Lorentz covariance of the action, but the negative metric states 
do not worry us because they cancel against the ghosts circulating in the 
theory because of the Faddeev—Popov determinant. When we exponentiate 
the Faddeev—Popov determinant, we must introduce two anticommuting ghost 
fields b and c. The resulting gauge-fixed action still has a residual symmetry 
generated by the BRST charge: 


lo) 1 CoO 
O = oly — 2) + > [e-nLn + L-nen] — 5 een a) 
n=1 n,m=—0o 
(2.10.11) 
Fixing 
g? =0 (2.10.12) 


sets the intercept to be one and the dimension of space-time to be 26. The 
physical states of the theory are defined by 


Q |phy) = 0. (2.10.13) 


As in point particle theory, interactions are introduced by summing over 
different topological configurations in the path integral. The fundamental func- 
tional equation for the interacting theory, upon which this entire chapter rests, 
is given by 


An(ki, k2,.-.,kn) = Sf aun f Deus f ox 


pay 
x So [ Lavar [vee x" 
=) 


Sf peo fae (T] JB explik;, ee ) 


(2.10.14) 


one 
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The amplitude can be computed exactly in the conformal gauge by using the 


identity 
l , 
[ oxew|-ae fax, aK Pr +i f JyX"a'e| 


= exp {s | Ju(z)G(z, z')F*(2') dz az 


= [Jeet (2.10.15) 


i<j 


In the conformal gauge, the sum over topologies is given by the sum over 
all conformally inequivalent configurations. If we consider conformal trans- 
formations that map the upper half-plane into itself and the real axis into 
itself, then the points along the real axis transform according to a projective 
transformation SL(2, R): 


,  aytb 
~~ eyt+d’ 


where the coefficients are real and satisfy ad — bc = 1. This fixes the measure 
dt, so the formula for the N-point function becomes 


av= f Tae |] rea oene (2.10.17) 


=3 2<i<j<N 


(2.10.16) 


where the z’s are ordered along the real axis. 

The transition to the harmonic oscillator formalism is straightforward be- 
cause, in the conformal gauge, the Hamiltonian is diagonal on the Fock space 
of oscillator modes. The propagator from the configuration X,, to X;, is 


] 
(xa fo ede |Xp) = (Kel -—— IX) (2.10.18) 
The vertex is equal to 


(X..| eikinX* (0.7) |X.) : (2.10.19) 


We can remove all the string eigenstates because 
Ix) f DX (xi = iF (2.10.20) 
Thus, the N-point function is equal to 


An = (0;k;| Vo(k2)DVo(k3)-+- Vo(kKn—1) 10; kw) . (2.10.21) 


In the operator formalism, projective invariance can be reestablished by 
using the fact that the operators Ly; and Lo generate the projective group 
SL(2, R). In fact, under an arbitrary conformal transformation, the vertices 
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transform as 


TV(ki,z%)T = ( = ea'k? Yn ViGrz,). (022) 
n=! 


or 
n d 14,2 
[La V(ki, z;)] =z eG, talk? | Viki zi) (2.10.23) 


We say that the vertex V has conformal weight equal to a’k? = 1. 

We can explicitly construct the operators that will generate only physical 
states with zero or positive norm. The previous identity shows that operators 
with conformal spin equal to | automatically commute with the Virasoro gen- 
erators. This allows us to create operators, based on the vertex for a massless 
spin-1! particle, that will generate the physical space. We can construct three 
sets of mutually commuting operators which together generate the entire Fock 
space of harmonic oscillators: 

VV 


Fie Sle 


eel): (2.10.24) 


By taking only those states built out of V!,, and Ven and dropping the L_,, 
we obtain a new Fock space that has only states which have positive and zero 
norm. Thus, the set of spaces that satisfy: 


Ln |p) = 0, 
(Lo — 1) 1) = 0, (2.10.25) 


is free of negative norm states in 26 dimensions. 

Finally, we note that string theory is easy to quantize because its Hamiltonian 
is quadratic in the string variables. Simple harmonic oscillators can be used 
to quantize the theory. However, in M-theory, this simplicity breaks down, 
because the membrane 1s quartic in the Hamiltonian. At present, there is no 
satisfactory way in which to quantize the free mambrane. 
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CAPTER™ 3 


Superstrings 


3.1 Supersymmetric Point Particles 


Supersymmetry is one of the most elegant of all symmetries, uniting bosons 
and fermions into a single multiplet: 


Fermions < Bosons. 


By uniting fields of differing statistics, supersymmetry and supergroups have 
also opened up an entirely new area of mathematics. 

However, the irony is that there is not a single shred of experimental evidence 
in its favor. For example, physicists have tried to fit the electron or neutrino into 
supersymmetric multiplets, but the scalar partners of these leptons have never 
been seen. In fact, none of the presently known particles has a supersymmetric 
partner. Some critics have called supersymmetry a “‘solution looking for a 
problem.” 

Although there are absolutely no empirical data to support the notion of su- 
persymmetry, it is undeniable that supersymmetry provides a wealth of highly 
desirable theoretical mechanisms that hold tremendous promise. Supersym- 
metry is more than just an elegant way in which to unite elementary particles 
into aesthetically pleasing multiplets; it also has definite practical applications 
to quantum field theory: 


(1) Supersymmetry generates super-Ward—Takahashi identities that cancel 
many normally divergent Feynman graphs. For example, Feynman loop 
diagrams with bosons and fermions circulating internally in the loop differ 
by a factor of —1. Because of supersymmetry, the boson loop can cancel 
against the fermion loop, leaving us with a much milder divergence. We 
see, therefore, that Yang—Mills theories with supersymmetry have better 
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(2) 


(3) 


(4) 


(5) 


3. Superstrings 


renormalization properties than ordinary gauge theories. In fact, certain 
“nonrenormalization theorems” can be proved to all orders in perturbation 
theory. 

Supersymmetry may solve the “hierarchy problem” that plagues ordinary 
Grand Unified-type theories. In GUTs, there are two widely separated en- 
ergy scales, the energy scale of ordinary particle physics in the billion 
electron volt range and also the GUT energy scale at 10!° or so billion 
electron volts. In between these two energy scales is a vast “desert” where 
no new phenomena are found. However, when renormalization effects are 
calculated, the two energy scales inevitably begin to mix. Loop correct- 
ings, for example, to the quark masses can push them up near the GUT 
scale, which is unacceptable. “Fine-tuning” one’s coupling constants and 
masses by hand can, in principle, solve the hierarchy problem, but this 
is very contrived and artificial. Fortunately, the Ward—Takahashi identi- 
ties of supersymmetry are strong enough to enforce “nonrenormalization 
theorems” to all orders in perturbation theory. Thus, supersymmetry 1s 
necessary to stabilize these two mass scales in perturbation theory and 
prevent mixing. 

Supersymmetry may help shed light on the “cosmological constant” prob- 
lem. Empirically, the cosmological constant term A./—g. which is a 
correction to the Einstein—Hilbert action, is exceedingly small on an as- 
tronomical scale. The problem is to explain the near vanishing of the 
cosmological constant without “fine-tuning.” Supersymmetry is probably 
strong enough to force the cosmological constant to be zero to all orders 
in perturbation theory (because this term breaks supersymmetry). This 
doesn’t completely solve the cosmological constant problem. however. be- 
cause we must inevitably break supersymmetry to reach ordinary energies. 
(The problem, therefore, is to explain the vanishing of the cosmological 
constant after sypersymmetry breaking has occurred.) 

Supersymmetry eliminates many undesirable particles. The tachvon that 
appears in the bosonic string model, for example, is eliminated because 
it violates supersymmetry. By eliminating these particles. supersymmetry 
also reduces the divergence of the higher loop graphs. In Chapter 5 we will 
show that the potential divergences of the superstring theory are associated 
with the infrared emission of tachyons and dilatons. Thus. by eliminat- 
ing these particles, we simultaneously eliminate the souree of potential 
divergences. 

Lastly, when supersymmetry is clevated into a local gauge theory. it natu- 
rally reduces the divergences of quantum gravity. This is because local 
supersymmetry can be defined only in the presence of gravitons (see 
Appendix). Local supersymmetry is thus intimately tied up with general 
relativity. In fact, local supersymmetry successfully eliminates the lower 
loop divergences of supergravity. However, the largest of the supergravity 
theories, O(8) supergravity, probably has a divergence at the seventh loop 
level, which most likely rules out supergravity as an acceptable quantum 
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field theory. Only when we combine local supersymmetry with the con- 
formal invariance of the string theory do we have a large enough gauge 
group to eliminate perhaps all the divergences of quantum gravity. 


Supersymmetry, as an invariance of an action, was first discovered in the 
string theory. Gervais and Sakita [1] showed that an extension of the usual 
bosonic action possessed a symmetry that converted bosons into fermions. 
Unfortunately, it languished for many years because the supersymmetry of 
the early string model was a two-dimensional supersymmetry on the world 
sheet. It wasn’t until relatively recently that it was finally proved that the 
string model possessed both two-dimensional and 10-dimensional space-time 
supersymmetry. 

Let us begin our discussion by considering the simplest possible spinning 
action, the spinning point particle. In addition to the variable x,,, which locates 
the position of the point particle, let us introduce Dirac spinors 0“, where A is 
arbitrary, and the Dirac matrix in D dimensions I“. In general, a Dirac spinor 


in D dimensions has 2''/?)” complex components. We can write [2, 3] 
1 in : 
Sa 5 fin - 101,64 ar. (3.1.1) 
This point particle action is invariant under 
604 = «4, 
ee eee 
; Bele) 
dx4 = ie4T04, ( 
oe = 0) 
Notice that the combination 
Tt = x4 — ig4r+e4 (3.1.3) 


is an invariant all by itself under this transformation. Thus, any expression 
involving this combination will be invariant under this symmetry. The strange 
feature of this action, however, is that half of the components of the fermionic 
field cancel from the action by itself. 

By varying the e, x,,,and 6 fields, we can derive several equations of motion: 


Il’ =0, 
TI” = 0, 
r-mé=—0. (3.1.4) 
We also have 
(rr = -I’? =0. (3.1.5) 


Thus, half the eigenvalues of the matrix y -IT vanish. But since @ always appears 
in the combination 6“11,,1°“6%, half the components of the Dirac spinor are 
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actually missing from the action. Thus 6 is not an independent spinor but 


satisfies a constraint that reduces its components by half. 
The reason for this decoupling is that the action is invariant under yet another 


local symmetry [2]: 
664 = iT - Tk", 
dx" = id4T*#$04, (3.1.6) 
Se = 4e04K4, 


Furthermore, there is also another bosonic invariance of the action: 


604 — 104, 
bx" = i64T“504, (3.1.7) 
be = 0. 


When we try to commute two supersymmetry operations given in (3.1.6), we 
find that the algebra does not close unless we use the equations of motion: 


(81, 69]04 = (2iP eAO2P M4? +4iP - Me A0%x8)—(1 2). (3.1.8) 


(This is actually typical of supersymmetric actions. Notice that the numbers 
of components of x, and 6“ do not necessarily match off-shell, which means 
that auxiliary fields are in general necessary to close the algebra.) 

Lastly, if we calculate the canonical conjugate to the coordinates, we find 


) 
ty = re iT*11,04. (3.1.9) 


This poses vast complications for the covariant quantization. Because there is 
an explicit dependence on x within the canonical conjugate, the quantization 
relations become nonlinear and hence exceedingly difficult to solve. Worse, 
it turns out that the term I'“IT,, becomes a projection operator in the theory. 
meaning that we cannot invert the transformation and solve for 6“. In other 
words, a naive covariant quantization of the spinning point particle does not 
seem to exist at all! This is a warning that the supersymmetric string is not 
going to be as simple as super-Yang—Mills or supergravity theory. As a con- 
sequence, we will discuss the simpler two-dimensional world supersymmetry 
of the Neveu-Schwarz—Ramond (NS—R) model first, and then the more com- 
plicated 10-dimensional space-time supersymmetry of the Green—Schwarz 
model. 


3.2 Two-Dimensional Supersymmetry 


Given the problems associated with superparticles (such as lack of a covariant 
quantization and lack of an algebra that closes off-shell), let us temporarily 
abandon space-time supersymmetry and discuss the two-dimensional world 
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sheet symmetry of the simplest possible action involving free strings and free 
fermions. This action will already be gauge fixed in the conformal gauge, but 
it will reveal all the essential features of two-dimensional supersymmetry. In 
this gauge-fixed formulation, we will impose the gauge constraints on the Fock 
space by hand. Later, we will present the complete action, where we will be 
able to derive these constraints starting from a locally symmetric action. 

In the conformal gauge, we have [1] 


I _ 
L =~ (8,Xyd°X" — ib" p"da,), (3.2.1) 


where a = 1, 2 and labels two-dimensional vectors, and jw is a space-time 
index. 

Notice that w is a bizarre object, an anticommuting Majorana spinor in two 
dimensions and a vector in real space-time. Let us define 


yt = ( rh ) , pt pio’, 


wr 
0 QO —-i 
pale sale 
ee oa 
ae a (3.2.2) 


ae p”} = apy 
If we write this out explicitly in components, we have 


rae : 
a as ~X—X'- X' +i Wold, + Oo) Wo + (iWi(O, — O51). (3.2.3) 


Although this action is gauge fixed, it is still invariant under the global 
transformation: 


5X4" = Ey", 
Sy" = —ip4d, X“e. (3.2.4) 


Thus, by temporarily abandoning our attempt to produce a theory of strings 
with genuine space-time spinors, we have achieved a two-dimensional 
supersymmetric theory that is quite simple, involving free bosonic and 
anticommuting fields. 

Historically, the NS-R theory [4-6] was the first successful attempt at 
introducing spin into the dual model. It was also the first example of a lin- 
ear supersymmetric action [1] and was soon followed by four-dimensional 
supersymmetric point particle actions [7, 8]. 

Now that we have written down our two-dimensional supersymmetric ac- 
tion, let us retrace the steps we used in the previous chapter to solve the system. 
The next step is to extract the currents associated with these symmetries, then 
establish the algebra that these currents generate, and finally apply these con- 
straints on the Hilbert space. The sequence we will follow in this section is 
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a straightforward generalization of the steps we developed in the previous 
chapter: 
Action > Symmetries > Current > Algebra — Constraints > Unitarity. 
Gr) 
Following this strategy, let us now calculate the supersymmetric current as- 
sociated with this symmetry. As we saw in the last chapter, the existence of 
conformal symmetry is sufficient to generate a conserved current. In (1.9.8), 
we saw that this conserved current could be written as 
q OL 5 
u 80,6 5e% 


(3.2.6) 


By inserting (3.2.4) into (3.2.6), we find 
Ja = 50” pu" Xp. (3.2.7) 


To check that this is conserved, we must first write down the equations of 
motion for the system, which is especially easy since it is a free action: 


[8, + 06] = 0, 
(0, —0,]w; =0. (3.2.8) 
It is now easy to show 
a0) =e (3.2.9) 
Written out in components, this equals (Jz = 1(Jo 79) 
J_ = 500 (8; — 9) Xp, 
Jn = 5) (Or + 05) Xu. (3.2.10) 
Using the equations of motion, we can show that 
(dpe, )Jaee0. 
(0, —%F),/ 20. (Sei) 


In addition to the supersymmetric current, we also have the energy- 
momentum tensor, which as we saw earlier in (1.9.17) can be written 
as 


b6L 
i == (aly. aaa LSé’. ) 
\ 50,0 p ‘ (3.2012) 


This can easily be adapted for our purposes. Inserting (3.2.1) into (3.2.12), we 
find 


be te 
Tab = OX" 0, X y a qh Pad V =f qh Pda Vy a (Trace), 
Ogi); (213) 
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where T,,, is traceless because we have explicitly subtracted out the trace. 
Written out explicitly, this is 


To = Ti, 
cella i 
= 5 (xX? ot xe) _ qo ae Io) Wou im Vi =F Oo Wins 
To, = Tio (3.2.14) 


CESS; 
=X-X + qo a Oo) Wou a avi Gr “ Oo )Wip- 


Our basic strategy (3.2.5) is to use these currents to restrict the Fock space so 
that all ghosts are eliminated: 


ei (3.2.15) 


(We must stress again, however, that we are simply imposing this constraint 
ad hoc on the Fock space. Because the action is not locally supersymmetric, 
we cannot derive this constraint from first principles. Later, when we present 
the fully symmetric action, we will see that these constraints emerge because 
of a local symmetry.) 

Unlike the bosonic string, we actually have a choice of two distinct bound- 
ary conditions that we can place on the Wo and y; fields, either periodic or 
antiperiodic. At o = 0, we can always choose Yo = . However, ato = 7, 
we have the choice of two distinct boundary conditions. If the field 1s periodic 
(antiperiodic), we have the Ramond (Neveu—Schwarz) boundary conditions 
£9, 10}: 


Ramond: { Wola, T) = Wi(7, T), (3.2.16) 


Neveu—Schwarz: Wo(z, tT) = —Wi (a, T). | 


Given two different types of boundary conditions for our spinor, we naturally 
have two different ways in which to power expand these two fields in Fourier 
components with integer (half-integer) modes [8, 9]: 


oO 
- ESN GE ae is aS Oa (af aa 


R: — (3.2.17) 
i, (0,1) = Dae a dite-inr+o), 
lal (og a) = q-'/2 Ss bie" 
reZ+1/2 
os | (3.2.18) 
wi'(a, T) = q-1/2 phen ae 
eo . 


The next step in our strategy (3.2.5) is to isolate the algebra that these currents 
generate. In the previous chapter, we saw that the symmetries yielded the 
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energy-momentum tensor, which in turn generated the Virasoro or conformal 
algebra. For the NS—R action, we will find that the superconformal algebra is 


generated. 
With this decomposition of the fields, the moments L, of the current 7, 


can now be written as 
: : i —ine 
L, = on | do {e'""’ (To i To; ) +et (Too = To1)}. (322519) 
tT JO 


Notice that the definition of the conformal generators now includes the 


contributions from the anticommuting sector. 
We can repeat this Fourier expansion for the supersymmetric current as well. 


For the R sector, we have 


pia as ; : 
Pe v2 / dole" I, +e" J}. (3.2.20) 
XT Jo 
For the NS sector, we have 
Ohl us ; ; 
ee — v2 da{e"’ J, +e"? J_}. (322.20) 
HT Jo 


To extract the algebra from these moments of the currents, we need to construct 
the canonical conjugates of the fields. If we define 


) 
i z : (i222) 
OWa in 
then we find that the fermionic field is self-conjugate. Thus, we impose 
(v4a(0, 7), Wg(0", T)} = 15 4R8(0 — 0)”. (3.2.23) 
Therefore the oscillators in (3.2.17) and (3.2.18) obey 
R: Hd’) =n” 
{d) ’ di} Uy] | On,—m> 3 2.24) 
NS Be, Ds | Siero, oa 


It is important to notice that the zero component of the R sector is 
proportional to the Dirac gamma matrices: 


{dé d¥} =n”. (3.2.25) 


Hence, we will see that the R sector corresponds to the fermionic sector, and that 
the NS sector, even though it contains anticommuting operators, still remains 
a bosonic sector. The fact that boundary conditions like (3.2.16) play such 
an important role in the development of the fermionic and bosonic sectors 
of the theory is quite novel and apparently unique to the string theory. (This 
apparently obscure tact will play an important role when we discuss multiloop 
amplitudes defined over surfaces with holes. Then these boundary conditions 
will determine what are called the “spin structures” of the manifold.) 
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Once the commutation relations are established, we can construct the algebra 
that these tensors generate. As before, the moments of these currents will 
generate a closed algebra. The algebra for the NS sector is 

D 
[Ems Ln] = (m — n)Lingn + rae —m)3m—n: 
[Lm G,|= (4m a F)Gm+r, (3.2.26) 
{G,, G,}= 2Lbras a 5 D(r? an aoe: 
Explicitly, we have, in terms of oscillators in the NS sector: 


(bo ly . 7 hee 1) - : 
Le 2 +S A _nAmtn: + 5 ye (r + 5m) :b_-Dm+r?, 


n=—CO r=—-© 
Cp ON eee (07) 
n=—0O 


Repeating the same steps for the R sector, we have 


D 
[Lm» Ln] = (m —n) Linn + git fmm 
[Lens Pal = Gm — 71) Finn: (3.2.28) 
{Fins F,} = omen oF $Dm*5n,—n- 


Written out explicitly, we have 


(n+ im) edendnen 


WE 


1 lo. @) 
Lm = 5 SS nmin: + 5 


n=—00 n 


—0 


i — ee eae me (3.2.29) 
We can read off the Hamiltonian directly from the components of the energy— 
momentum tensor. In the conformal gauge, the Hamiltonian for the NS sector 
is 
: He aia Dy eer oy. (3.2.30) 
7 


r=1/2 


For the R sector, it is 
Co co 
Bia aa) Pen tO Ph. (3.2.31) 
n=1 m=] 


Notice that this Hamiltonian is diagonal on the Fock space of harmonic 
oscillators. This is extremely important, because this means that we can easily 
make the transition from path integrals to the harmonic oscillator formalism. 
The propagator for the theory becomes a simple integral over the exponential, 
which is easily performed. 
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k ,b4,, 10> 


FIGURE 3.1. Regge trajectories for the NS-R model for open strings. The boson 
states corespond to the Fock space generated by all possible products of a* and b* 
harmonic oscillators acting on the vacuum. The fermion states correspond to the 
Fock space generated by all possible products of a* and d~ oscillators acting on 
the vacuum, which is a spacetime spinor, not a scalar. On the left, the massless 
spin-1 particle corresponds to the Maxwell or Yang-Mills field (all tachyon states, 
including the vacuum, can be eliminated when we impose the GSO projection). On 
the right, the massless spin-4 fermion ‘s the supersymmetric partner of the massless 
vector field. : 


The states of the theory, as in the bosonic sector, are simply given by the 
Fock space of the product of these oscillators: 


NS eigenstates: I] | [fateh 10) , (3.2.32) 
nom 

Reigenstates: [|] |{a},}{d7,, 10) ua. (3.2.33) 
nyu mv 


where u, corresponds to an arbitrary spinor, which as yet has no restrictions 
placed on it. Some of the lowest lying states in the NS sector are as follows (see 
Bipsoel): 
vacuum 1) k? = 2, 
NS states: tachyon kybt 10), k=1, (3.2.34) 
massless vector a", |0), 
spin-+ fermion |0) uo, 
R states: spin-3 fermion d“, |0) ua, (3.2.35) 
spin-3 fermion a“, |0) ua. 

In summary, we find a remarkable similarity between the NS -R model and 
the original bosonic string. In cach case, we begin with an action, define its 
symmetries, generate its currents from the symmetries, construct the alge- 
bra from the currents, and then apply the constraints onto the Hilbert space. 


The main difference is the addition of the supercurrent, which generates the 
superconformal algebra (3.2.26) and (3.2.28). 


oon Drees Ht 
The last step in our strategy is then to apply this superconformal algebra 
onto the Hilbert space of the NS-R model. The ghost-killing conditions are 
Fr, Ora, jo) — 0 


for positive n and r. To establish the mass—shell conditions, however, we will 
now have to investigate the tree amplitudes for the NS-R model. 


3.3 Trees 


Once again, our starting point for the interacting string theory is the functional 
integral. Unfortunately, we have no guiding principle for the construction of 
the interacting theory except for intuition. As a guess, let us multiply the usual 
vertex term Vo with a factor of k,,y" at the point at which a spin-0 particle 
enters the diagram. Then a reasonable assumption is that the N-point scattering 
amplitude for this scalar particle is a generalization of (2.5.2) and is given by 


2 [ aupxdy []kvte 
i=1 


topologies 


a tu {[]huvireto™*, G3.) 
=| 


topologies 


Ae? 3 en 


where 


Dy =| [[ ]avute. 0). (3.3.2) 


aonn 


where we are now functionally integrating over an infinite sequence of 
Grassmann variables. 

As in the bosonic functional integral, we can remove the functional integrals 
at each intermediate point along the string sheet because the Hamiltonian is 
diagonal on this space. Thus, by using 


| = |X) |v) / DX Dy (Wl (X| (3.3.3) 


at each intermediate point, we can remove all the functional integrals, leaving 
only the harmonic oscillators. Thus, once again the functional integral permits 
a derivation of the harmonic oscillator formalism, which we view as only one 
particular representation of the functional integral. 

In oscillator language, the vertex for the emission of a scalar particle with 
momentum k,, becomes 


V =k’ Vo: (3.3.4) 


The underlying motivation for this choice is the rule that vertex functions must 
have conformal weight 1, in order to preserve the ghost-killing conditions 
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(2.9.5). From (3.2.27) and (3.2.28), we can calculate the commutator of L,, 

with y,,, and we find that it has conformal weight +. Since Vp has conformal 

weight a’k? and y has weight i, the weight of V is the sum of the two: 
cereal. 3 k= — C35) 

2a’ 

where a’ is equal to 7 This vertex function, which corresponds to the emission 

or absorption of a tachyon, is guaranteed to satisfy the correct ghost-killing 

conditions. 

The propagator is easy to calculate. Notice that the Hamiltonian is diag- 
onal in the space of harmonic oscillators. Therefore, we will use an explicit 
representation of the functional based on the normal modes of the harmonic 
oscillator. Then we have 


D= [ ete dr = 
0 


Lo—1 
Let us construct an N-point function of tachyons: 
(03 kil ky» DijpV(k2)D ++ V(kw—1)kw + b-1/2 |0; kw) , G37) 


where we have placed the tachyon states to the left and right of all the vertices 
and propagators. In this formalism, we can prove cyclic symmetry in the same 
way it was shown for the bosonic string. We first note: 


(3.3.6) 


Vik, 
lim en eae 

y>0 y 

lim (0; 0] yV(K, y) = (0; -k|k - bi. (3.3.8) 


This permits us to treat all external tachyons alike. By commuting the last 
vertex to the left, we can show, in exact parallel to the bosonic case studied in 
(2.6.24), that the amplitude is cyclic symmetric. 

At this point, however, we have a problem. The tachyon state that we have 
just constructed corresponds to k,,b", ,. |0;k), which satisfies 


[Lo Dib i rk) = 0. > Qs. (3.3.9) 


However, this means that the vacuum state |0: k) has even lower mass because 
it satisfies the following conditions: 


[Lo—1]|0;k) =0 > o’k? =1. (3.3.10) 


We are faced, therefore, with the unusual problem that the true vacuum of the 
theory (3.3.10) does not correspond to the tachyon that we have just constructed 
(3.3.9). In other words, the Hilbert space seems to be too large: the true vacuum 
is an unnecessary state. We seem to have two lowest lying states. 

The resolution to this puzzle comes from the fact that the vacuum state, 
indeed, is a redundant state that can be removed from the theory. This process 
is accomplished by redefining the Hilbert space of the theory. 
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The formalism that we have been working with is called the F, formalism 
and is a clumsy one. The vacuum of the theory is not equal to the tachyon, so 
the Fock space is actually larger than necessary. Although cyclic symmetry is 
easy to prove in this formalism, we would like to introduce a more stream- 
lined formalism, called F, [10], which works with the reduced Fock space by 
removing the vacuum state at k? = 1/a’. 

To accomplish this, let us rewrite the tachyon state as 


kde 12 10;k) = G_1/2 |0;k), (3.3.11) 
(05 kl ky bi) = (0;k| Gyr. (3.3.12) 


Now we rewrite the N-point tachyon scattering amplitude (3.3.7) using 
(B32) 


Ay = (0;ki| Gij2V(k2)D +++ V(kw_1)G_1/2 |0; ky) . (372213) 


Now comes the crucial step. We will push the G,2 to the right successively 
through various vertices and propagators. We need the formulas 


{G,, V} = [£2,, Vo] = [Lo +r — 1] Vo — Vo(Lo — 1), 
i 1 
oo GC . 3.3.14 
a ees 1/2 ( ) 
It is essential to notice that when we shove G1,2 to the right, we change the 
intercept of the propagator 


1 1 
a (3.3.15) 
Pr ae 


while all the other terms involving the L’s vanish as in the bosonic case. Finally, 
we have pushed G17 all the way to the right, where it vanishes on the tachyon 
state: 


Gi/2G_1/2 |0) = 2L9 — G_1/2G1/2) |0) = 9. (33.16) 
Thus, putting all terms together, we finally have 


An = (0;ki| ki - bi 2V (kz) 


+++ V(ky_i)kw + B12 |0; ky) . 


Lo — 1 


= (0; ki| V(k2) »++ V(kw—1) 0; ky). (3.3.17) 


pe 
Surprisingly, we have now completely rewritten the original amplitude such 
that the set of resonances has been shifted. In particular, the old vacuum state 
in the F, formalism at @’k* = | has vanished from the Hilbert space. It has 
decoupled completely in the new formalism, which we call F> [10]. Instead, we 
are left with the tachyon state being represented by |0; k), which now satisfies 
the new condition [Ly — 4] |0;k) = 0 with ak? = 

This is quite remarkable. The state |0;k), which used to represent the old 
vacuum state at a’k* = 1, has now suddenly transformed into the tachyon state 


Ni— 
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ata’k? = i Thus, in the F, formalism, the tachyon state and the new vacuum 
are the same particle. (We should be careful in noting that the same symbol 
|0; kK) can represent either the old vacuum in the F; formalism or the tachyon 
in the new F) formalism.) 

This shifting of the Hilbert space is a novel feature not found in point par- 
ticle field theories. In fact, when we discuss the conformal field theory in the 
next chapter, we will find that this strange “picture changing” phenomenon 
arises whenever we construct irreducible representations of the superconfor- 
mal group. Hence, it is not a trick, but an essential feature of the group theory. 
(We should also mention that when we make the model space-time supersym- 
metric, we will eliminate the tachyon state as well. Thus, the tachyon decouples 
from the true superstring Hilbert space, leaving a unitary theory.) 

Let us summarize the differences between these two formalisms: 


vemex = V, 
= 
F;: ea au 7 - oe > 4 (3.3.18) 
tachyon: k, - b_,/. |0;k) (@’k* = 5), 
vacuum: |0;k) (a’k? = 1), 
vertex = V, 
Fy: propagator = (Ly — $)"', (3.3.19) 


tachyon = Vacuum: |0;k) (a’k? = 4). 
The advantages and disadvantages of the two formalisms are as follows: 


(1) In the F, formalism, manifesi cyclic symmetry is much easier to prove. 
However, we must carry along the excess baggage of the vacuum state, 
which decouples completely. 

(2) In the Fy formalism, cyclic symmetry is obscure. but ghost elimination 
and gauge transformations are easy to perform. The advantage is that we 
are now working in a smaller Fock space. 


Using the F> formalism, it is not hard to evaluate the four-point function 
exactly: 


I Tue r(1 — a(s)) PO — a(t) 


0, VG MG Weyer 
(0, “lal Via) P(1 = as) — a(t)’ 


(3.3.20) 


where a(s) = 1+ a's. 
There is also a straightforward generalization to the N-point function. The 
N-point function is represented as 


Vik Vikw. 
es [dus (0:0) Ven Yn) 9.9), (ec 
yi YN 
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In general, this expression contains many factors that are tedious to work out. 
A simple method of obtaining the entire result is to use the relation 


vo) 
Ue 
Notice that if we power expand the exponential, only the linear term survives 


the 6 integration, and we wind up with the previous expression for the vertex. 
We have, at this point, done nothing. Now, let us use two identities: 


(0) 2d #02) 19) _ a 
JV sf 32 yi — y2 


10) = i) d6, dO, exp[ky - ky In(y1 — y» — 6162)) (3.3.23) 


= | aoexptik X + 0k-W//y). (3.3.22) 


Vin) V2) 
yo 4/ V2 


It is not hard to generalize the above relation to the N-point tachyon amplitude: 


or] [Yo = fTI- —Tlo-3 yj — 6,0)". (3.3.24) 


i) AV Gj 


(Of 


The advantage of this expression is that we can now read off the various terms 
that appear in the formula by successively power expanding and integrating 
over the Grassmann variables. 

In addition to tachyon—tachyon scattering amplitudes, we can also calculate 
the scattering of massless vector particles (which correspond to Maxwell and 
Yang—Mills particles). The choice of the vertex function for the massless vector 
particle is constrained by the fact that it must have conformal weight | and the 
correct spin. A natural choice for this vertex is 


Vite, k) = iG, C 5 welk*} =a = a Wk - eb: 


where ¢ is the polarization vector of the vector particle and k? = ¢ -k = 0. The 
conformal weight of the vertex is equal to the sum of the conformal weights of 
its individual factors. Since has conformal weight + and e‘** has conformal 


weight a'k?, this means that the conformal weight of this vertex is given by 
5 tates Ose, 


which is the desired weight. This vertex is guaranteed to satisfy the G and L 
ghost-killing conditions. It is not difficult to calculate the N-point scattering 
amplitude for this massless gauge particle using the same formalism developed 
for the tachyon. For example, the scattering amplitude for four massless gauge 
particles is given by 


r(—45)P(—31) 


A = a 
oe rerers Sy 
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where the kinematic factor K is given by 


K = —4(stei324 + subos14 + tub12;34) 
+ 4s(kyak32boa + koskar Sis + kiskaoSo3 + koaks1 614) 
+ 5t(kyika3b31 + k3aki2604 + koakizos4 + k31ka26i2) 
+ Su(kirkazo32 + kaakoi S14 + kiak23834 + ks2kai 612), 


where 
kij = & - ky, ci = Si Sj, bij: = Sig Sk. 


(The scattering involving fermions can also be calculated; they will also have 
the basic form, except that the kinematic factor K will depend on the external 
spinors.) 

We should add that another advantage of working with the F> formalism is 
that we can explicitly show the invariance of the amplitude under the following: 


oy, ae, 
56; = €. (3.3.25) 


This generates the group Osp(1, 2) (see Appendix), which is the supersymmet- 
ric generalization of the projective group SL(2, R). This group is generated by 
the algebra formed by the set 


Gitp. La, “Le: (3.3.26) 


Let &2 be an element of the group Osp(1, 2). Then the proof of superprojective 
invariance can be shown by noting: 


Q|0:0) = |0;0). (3807) 


(Notice that, as in the bosonic case, the vacuum state |0:0) is not a physical 
state. Thus Q, which does not in general kill physical states, can annihilate that 
vacuum state.) The vertex rotates as 


QV(y, AQ? = V(y’, 6’), (3.3.28) 


where V(y. @) is the vertex function before we integrate out over @. 

Now that we have established the properties of the three-boson vertex func- 
tion in the Neveu-Schwarz formalism, the fermion—fermion—boson coupling 
(with an external boson line) for the Ramond model can also be calculated. 
We choose 


V(k) =P ze**:, (3.3.29) 
where 


P= yjy(—1entren (3.3.30) 
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where y\; is the product of the Dirac matrices. In addition to this vertex 
function, we also need the propagator: 


me! Fo 

ae eee a 
where Fy is given in (3.2.20). We have chosen the intercept to be zero (which 
we will see is the proper choice in order to guarantee conformal invariance; at 


this early stage, however, we cannot yet motivate the choice of zero intercept.) 
Finally, the vacuum state is now a spin-5 fermion, given by 


Y> 10; 4) q Ha. (3.3.32) 


(Gen 


where we sum over the spinor indices a of the spinor u,. The scattering 
amplitude for a fermion interacting with several bosons is now given by 


u(qi) (0; qi] V(k2)D «+» DV (kw-1 |0; gn) u(qn). (3.3.33) 


So far, we have only exhibited amplitudes with two external fermion lines. 
Surprisingly, our first guesses, based on simple intuition, have all been suc- 
cessful. Because the action in the conformal gauge was that of a free theory, 
the NS—R model has been exceedingly simple. There is a price to be paid for 
this, however. 

In principle, because the string model can be factorized in any channel, 
it should be possible to factorize the R model in the meson channel and 
rederive the NS model or to extract out the multifermion vertex function. 
Actually, this is quite difficult. In particular, the fermion vertex function (with 
an external fermion leg coupled to internal meson and fermion lines) is an 
exceedingly difficult object to work with. This, in turn, makes it difficult to 
calculate multifermion amplitudes in the NS—R formalism. 

Although it can be shown that the R model can be factorized in the various 
channels to derive the NS model, the NS—R formalism is actually quite clumsy 
when calculating multifermion amplitudes. In the next chapter, we will see 
that the techniques of conformal field theory make the covariant calculation of 
multifermion amplitudes possible. 


3.4 Local Two-Dimensional Supersymmetry 


Notice that we imposed the following conditions by hand: 
(Tap) = 9, (3.4.1) 
(Ja) = 0. 
No justification was given. We merely appealed to the fact that there must exist 
a higher action from which these constraints can be derived ab initio. 


We will now describe the local generalization of the previous theory. The 
key to the construction of this locally sypersymmetric action is to add more 
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fields to the theory. In addition to the supersymmetric pair 
(Xp, Vu) (3.4.2) 


we introduce a two-dimensional vierbein into the theory and also its 
supersymmetric partner: 


(5; Xai (3.4.3) 


where Greek letters such as a and f label two-dimensional vectors in curved 
space, Greek letters like jz and v continue to label 10-dimensional space-time 
vectors, Roman letters such as a, b, c label two-dimensional vectors in flat 
space, where x, is a two-dimensional spinor as well as a two-dimensional 
vector, and where we have suppressed two-dimensional spinor indices. Thus, 
both the vierbein and the spinor x, have four components, as required by 
supersymmetry. Let us now construct the complete action, first written down 
by Brink, Di Vecchia, Howe, Deser, and Zumino [1 1—13]: 


l .7 7 - Oop 
aor —Frgi Ble aX" OpX — Yl Vay + 2Xap*p yt OpXy 
+ dW" Xap? exe). (3.4.4) 


Notice that the p matrices used above are actually defined in two-dimensional 
curved space because they are multiplied by the vierbein: 


Dp = ep. 
This action is invariant under 
Ok Sse. 
dw4 = —ip%e(0,X" — w" x,), 
2D SUSY: 0 Yi Xa) (3.4.5) 


be, = —2iép* Xz, 
8 Xa = Vie. 


It is also invariant under Weyl (scale) transformations: 


px 
sy! = —toy", 
Weyl: a = (3.4.6) 
de, = 0€4, 
8Xa = 40 Xa. 
It is also invariant, due to certain identities in two dimensions. under 
ONG — IPaN, 
de, =OUaox = 0, (3.4.7) 


Finally, by construction, it is also manifestly invariant under local two- 
dimensional Lorentz transformations and reparametrizations because of the 
presence of the vierbein. (The presence of ordinary derivatives like a, in the 
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action instead of curved space covariant derivatives D, is due to the fact that 
the two-dimensional connection field vanishes in the action.) 

Now we are in a position to derive the constraints that we imposed on the 
Fock space by hand. Notice that the variation of the action with respect to the 
vierbein yields 


0 (3.4.8) 
while variation of the vierbein’s supersymmetric partner yields 
pp? aX, w" — H(w)e% p? xq = 0. (3.4.9) 


These are the constraints that we promised earlier, which are now derived as a 
consequence of the variation of fields. 
We can now take the gauge: 


Cae, 
Xa = 0. (3.4.10) 


Notice that there are enough symmetries on the x, field to eliminate it entirely, 
a combination of two-dimensional supersymmetry (3.4.5) and the symmetry 
in (3.4.7). Then the constraints reduce to 


Tap = 9X ,9gX" + 4iW padp — (trace) = 0, (3.4.11) 
=p p'osX,y" 0; (3.4.12) 


(where the parentheses symbolize taking one-half of the symmetrized sum). 
These, or course, are the constraints we originally introduced at the beginning 
of our discussion in (3.2.7) and (3.2.13). Thus, we now have derived these 
constraints from an invariant action, rather than simply imposing them on the 
states of the system. 

In summary, this action is invariant under several local symmetries: 


(1) Local two-dimensional Lorentz symmetry; 
(2) Reparametrization; 

(3) Weyl rescaling; 

(4) two-dimensional supersymmetry; 


but not 10-dimensional spacetime supersymmetry. 


3.5 Quantization 


Quantization of the NS-R action is a straightforward extension of the quanti- 
zation of the bosonic string. The surprising feature is that the highly coupled 
NS-R action becomes a free theory after choosing the gauge. 

Again, we will use three methods: (1) Gupta—Bleuler, (2) light cone, and 
(6) BRS 
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Gupta—Bleuler 


The Gupta—Bleuler formalism is the simplest of the three and is the formalism 
we have actually been using all this time. Using the large set of symmetries in 
the model, we can choose the conformal gauge: 


Xa = 0, 
ee (3.5.1) 
é, = 4). 

Notice that in this gauge the action reduces to the one that we have been using, 

with the provision that we impose 


; )=0, 
NS: one (3.5.2) 
G&G |W) = 0, 
: F, |W) = 0, eer 


for positive n, m, and r. Thus, we have been quantizing the NS—R action in 
the Gupta—Bleuler formalism. 


Light Cone Quantization 


The essence of the light cone formalism is that we eliminate all unphysical 
ghost states by solving the constraints (3.4.11) and (3.4.12) explicitly in terms 
of the transverse or physical modes, rather than applying them onto states. We 


can impose: 
1 
= 
y : (3.5.4) 
W'(a, +9,)X, = 0. 
Most of the generators are only slightly changed. The real difference occurs 


in the — components of the fields, which contain the transverse components 
of the conformal generators: 


=] = l = | wai ad 1 ad, 9 p—_ 
a, = OR 00+ 5 (eS 20 Mie ar (3.5.5) 


Fro-N 


where the first term on the right refers to the bosonic generator we found earlier 
in (2.3.16), and 


by = — 5) ee (3.5.6) 


a ce é ' 
es" ¥ 2 3 ‘a Ct lh, Sl i Tay). (3.357) 


a 
P MESH f=] fewamo 
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We find, finally, 


[M~, M7] 


ata; ay ea =a ea A, 7). (3.5.8) 


Daal Die? 
A, = - —_——_}. 5: 
n( : ) +5 (2a : ) (3.5.9) 


Thus, Lorentz invariance is achieved only with D = 10 anda = 


where 


Nl— 


BRST Quantization 


BRST quantization begins with the calculation of the Faddeev-Popov de- 
terminant associated with the conformal gauge. Because of the large set of 
symmetries of the action, we can enforce: 


eo. 
Xa = 0. (3.5.10) 


As before, we must analyze the constraint when we make a variation of the 
fields in (3.4.5) and (3.4.7) 


Oe = Veet lDxt 3.11) 


As before, the Faddeev—Popov determinant (1.6.10) associated with the gauge 
constraint is given by 


5 
Arp = det E vee" | : G2:12) 

dé 
(The n field does not contribute to the Faddeev—Popov ghost determinant.) The 
Faddeev—Popov determinant associated with the anticommuting sector is thus 
det[ Vy] = det[V_] det[V-]. (325913) 


Analyzing this is a bit tricky because of the number of degrees of freedom 
within x,, which is both a two-dimensional spinor and a two-dimensional 
vector, and thus has four components. 

As before, the easiest way to calculate the determinant of an operator is to 
exponentiate it into the action: 


det[V,] = / DBDye~™, 


if = 
Son = = | a°zBiy Geo. (3.5.14) 


where now the ghost fields 8 and y are commuting variables. (Although this 
action looks similar to the previous ghost action (2.4.4) found for the bosonic 
string, it is essential to point out that these new ghost fields 6 and y transform 
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differently under the conforma! group than the b, c ghosts because they were 
derived from variations of a spinor x, rather than the tensor g,,. We will discuss 
this further in the next chapter.) 

The complete action, therefore, is now the sum of the original action with 
the sum over the anticommuting ghosts b, c and the commuting ghosts f, y. 

We can easily calculate the energy-momentum tensor and the supersym- 
metric current for the superconformal ghosts. Taking their moments, we arrive 
at the super-Virasoro algebra: 


© oO 
L& = > (m+n):bnenCrit D> ($m +2) :Bm—n¥nis 


m.n=—0o mn=—0Co 
woo) = 
F® = —2 S* diaYmint > (n—m)c-nBmin- (3.5.15) 
m,n=—0o mn=—CoO 


Similarly, the combined action, including the ghost contribution, is sull 
invariant under a nilpotent transformation. The generator of this nilpotent 
transformation is given by Q, where [14-16] 


fo @) i le. @) 
Q= » (LinCn + FenYn) — 5 a Gi — f):e 460m: 


n=—0O mn=—C 


= 3n = 
fe Ss (F + m) Cone aren = > Von Voneaen — aco, 
mn=—-Co 
(3.5.16) 


where the L and F generators are functions only of the w and d oscillators. If 
we let Q? = 0, then we arrive at 


D= IG, 


I 
3 (NS), 
= es By ° 
a 0 G51) 
The physical condition is now 
Q |phy) = 0. (3.5.18) 
Solving this, we arrive at the usual physical state conditions: 
{io = 0. 
NS: - 
G, 16) =0, iia (3.5.19) 
(Lo) lo) = 9, 
R: ; (3.5.20) 


F,, (oy=0. m= 0. 


Let us now calculate the anomaly contribution of these oscillators. We will 
find that the two contributions add to yield a vanishing anomaly, but only in 
10 dimensions. The ghost contribution yields 


(Fe PM") = 2ER ome G2) 
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From the R sector, the anomaly contribution was 5 Dm? + 2a. Thus, we have 


5 Dm* + 2a — 5m? =0 (3.5.22) 
which fixes 
D110. 
a0: 35:25) 
Similarly, for the NS sector, the anomaly is equal to 
(Gp Gy =2L, 47 —0r°. (3.5.24) 


This must cancel against the anomaly coming from the bosonic sector: + D(t? = 


t) + 2a. Thus 
SD(r? — 4) +2a+4—-5r? =0 (3.5.25) 
which vanishes if 
/D) = iO. 
Gi ‘. (33,26) 


3.6 GSO Projection 


The action (3.4.4) for the NS-R model was not invariant under 10-dimensional 
space-time supersymmetry. Although it is not obvious, the NS—R theory is 
also invariant under 10-dimensional space-time supersymmetry if we make a 
certain truncation of the Fock space. To see this, let us first analyze the space 
of states of the string. 

The total number of bosonic states at the mth level is equal to the partition of 
the integer n, that is, p(). To see this, let us first take the example of the fourth 
level. The total number of states that are eigenfunctions of the Hamiltonian at 
that level is five 


(ami) Ole Mame)” |0),  saesyiaes 10). 

(a_1)’a_ |0), a_4|0). 
Notice that this is precisely the total number of ways in which the integer four 
can be broken up as the sum of other integers, i.¢., the partition of four. In 


general, each state at the Nth level can be represented as one of the various 
ways we Can partition an integer: 


lo) =a" a ah ..-a™™ 10), (3.6.2) 


he tg) i) —nNM 


(3.6.1) 


where 


R|¢)=N |), 
M 
i = yon. (3.6.3) 
i=1 
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and R is the number operator for this space. 
In statistical mechanics, we often introduce the partition function that counts 


how many states there are at a given energy: 


Zee Ih) (3.6.4) 


We find the partition function useful because it will enable us to count the 
number of states of the string: 


Z = Tr(x*), (3.6.5) 
=e, (3.6.6) 


The coefficient of x” in this power expansion is nothing but the partition of NV, 
that is, p(NV). This trace can be performed in a number of ways. The simplest 
is to introduce coherent states: 


= / dd. ddte~*l jay (al. (3.6.7) 
Thus, the trace can be represented as 
Va / dante" (Alx® ia). (3.6.8) 


Performing the integral, we find 
Z| eee ee (3.6.9) 
n=1 


As a check, we can power expand this function (in one dimension) and show 
that each coefficient of x” is equal to p(N), ie., 


Z= > pix’. (3.6.10) 
=] 


This function is related to a famous function in mathematics, the Hardy— 
Ramanujan function. In fact, some of the miraculous properties of the string 
model are related to the identities of the Hardy—Ramanujan function. 

Next, we would like to compute the states of the NS and R sections to see 
if they are supersymmetric. At the very least, we want the number of on-shell 
states to be equal in the two sectors. 

Notice that the NS—R Fock space actually divides up into two smaller Fock 
spaces, depending on whether a given state has an even or odd number of b 
oscillators. Because the b oscillators are anticommuting, the total number N 
external lines must be even, because b oscillators must be contracted in pairs. 
Thus, we can define a “G-parity” operator that simply counts the number of 
b oscillators in a given state. It is —1 for an odd number and +1 for an even 
number: 


G = (—1)2en, | Get) 
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If we take the even G-parity sector of the theory, this eliminates the tachyon 
immediately, leaving the massless vector particle as the lowest lying state, with 
10 — 2 = 8 physical states. 

Meanwhile, the R sector can also be reduced. If we demand that our lowest 
lying massless fermion is Majorana—Weyl, then it has only 16 surviving modes. 
The counting goes like this. A Dirac spinor in D dimensions has 2!!/2)? = 
32 complex components. Demanding that it be Majorana (real) reduces this 
number by half, and demanding that it be Weyl reduces this number again by 
half. Let us now calculate the partition function that counts the total number 
of states at each level. For the NS sector, we have 


[e.@) 
ins = a dim V,,x” 
=0 


=e Ist) Gx], (3.6.12) 
where 
CO le, ¢) 
R=) nata,+ )° rbfb,. (3.6.13) 
n=1 Pali 


The trace can be evaluated exactly, leaving us with 


ie?) oO oO 
feaox (dae) Te +x"'2)§ TT - =| 
n=1 n=] n=! 


(3.6.14) 
By contrast, the trace over the R sector becomes 
fp 8 ince) (3.6.15) 
where 
CO 
R=) n(ala, + d}d,). (3.6.16) 
n= 
We find 
fa =8[ Ja —2x"y*a +2". (3.6.17) 


nl 
At first, we might suspect that there is no relationship between these two 
sectors. However, it was known to Jacobi as early as 1829 that these two 
expressions are precisely equivalent: 


fus = fr- (3.6.18) 


This, of course, is not a proof that the reduced NS—R model is supersymmetric, 
but it is a necessary condition. This projection is called the GSO projection 
(after Gliozzi, Scherk, and Olive [17]) and it has a profoundly important role 
to play in the superstring. (In Chapter 5, we will find the GSO projection is 
equivalent to modular invariance of the closed superstring one-loop amplitude.) 
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Thus, one of the great disadvantages of the NS-R approach is that space— 
time supersymmetry is very obscure. A second great disadvantage of the NS-R 
model is that the vertex function for the emission of a fermion is very difficult 
to work with and does not have nice properties under the Virasoro gauges. As 
a result, it took many years before two-fermion scattering amplitudes could be 
worked out. In the NS-R theory, the vertices for boson and fermion couplings 
are [18-22] 


Vep = kw" Vo, 
Ver = vi1(—1)%"4"" Vo, (3.6.19) 
Ves = eHiWwvo, 


where L4 is a Virasoro generator and 


oo = 
W = (0|p exp |, 5 ele ( a“ 2 Ja. 


m=0,r=1/2 


lee pS se L — 
Pewee ays = = b* b, U) Fs 
on 1 me ) Gliese ; af )F 


(3.6.20) 


where this vertex takes us from the bosonic Fock space to the fermionic Fock 
space because we have a bosonic vacuum on the left and a fermionic vacuum 
on the right. Unfortunately, this vertex has conformal spin ;, making it difficult 
to construct an amplitude that is conformally invariant. 

Thus, although the NS—R model is appealing because it is essentially a free 
theory in the conformal gauge, the problems with the fermionic sector force 
us to go to higher versions of the theory. Next, we will investigate the Green— 
Schwarz (GS) action which is equivalent to the GSO projected NS—R theory 
and is manifestly space-time supersymmetric. 


3.7 Superstrings 


Although the NS—R was simple and elegant, it displayed only two-dimensional 
world sheet supersymmetry and not genuine 10-dimensional supersymme- 
try. Let us now present the Green—Schwarz action, where the 10-dimensional 
supersymmetry is manifest [23, 24]: 


See dodt{ /gg"* Tl, «My + 212” apn yane! = 7" ae) 
— 26% 6'T#9,0'671,,8¢07}, (3.7.1) 
where 


Tee opx! = 76a. (372) 
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where the two 6“ (A = 1, 2) are genuine space-time fermion fields (rather 
than space-time vectors as in the NS—R model). These 10-dimensional spinors, 
however, transform as scalars in two dimensions, not two-component spinors. 
This action, like the supersymmetric point particle action (3.1.1), is invariant 
under global supersymmetry: 


ie 
ee aoa | (G73) 
However, in the proof, we have to make use of the identity 


Py Wa vol" Ws) = 0. (3.7.4) 


To prove this identity, we must make use of a Fierz transformation. It turns out 
that this identity holds only under the following conditions: 


(1) D =3 and the fermions are Majorana; 

(2) D = 4 and the fermions are Majorana or Wey]; 

(3) D = 6 and the fermions are Wey]; 

(4) D = 10 and the fermions are both Majorana and Weyl. 


The standard Dirac representation is complex and exists in any space-time 
dimension. The Majorana representation of the Dirac matrices is one where 
they are all real (or imaginary). Thus, they have half as many components as 
the Dirac representation. They exist only in D = 2, 3, 4 mod 8 dimensions. A 
Weyl representation of the Dirac matrices is one where we have projected out 
half the components by the Weyl projection operator: 


F(1+T p41), Pp41 = Vol --:Tp-1. 


The Weyl representation exists only in even dimensions. And lastly, spinors that 
are simultaneously Majorana and Wey] are defined only in 2 mod 8 dimensions: 


Majorana—Weyl spinor: D =2mod8. 


For our present case, we will choose Majorana—Weyl spinors in 10 dimensions. 

To prove that the action (3.7.1) is locally supersymmetric in 10 dimensions, 
let us first construct the following projection operators, which project onto 
self-dual and self-dual pieces of two-dimensional vectors: 


Pee =e) </e). (3.7.5) 


This projection operator has the following properties: 


; 
Pee Po 
Pi? 95, PX’ =0, 
la _ po, 1 
c*= PP Kp 


er Ka, 
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where the supersymmetric parameter is given by AC? auere Avcoulvor 20 
represents a two-dimensional vector index, and a is a spinorial index in 10 
dimensions. We will usually suppress the spinorial index. Notice that the last 
two equations state that « is self-dual (anti-self-dual) for A = 2 (1). 

Then the action can be shown to be invariant under 


b6° = 2 - Tk: 
bX" = i641 #864, 
5(./gg%?) = —16./e( Po” e360! + PL” K7F3,67). (3.7.6) 


The great achievement of the GS action is that it explicitly contains 10- 
dimensional supersymmetry. Because the fundamental field is now a genuine 
anticommuting space-time spinor (rather than the anticommuting vector 
field in the NS-R theory), we can explicitly construct the space-time 
supersymmetric operator Q,. 

The great defect of the presentation, however, is that naive covariant quan- 
tization, as in the point particle case, does not exist. Once again, as in (3.1.9) 
for the point particle case, we must face the fact that the quantization relations 
are nonlinear because 


i= ae iP“0,X,04 + +++. Gia 
bWa 
As a consequence, the commutation relations between fields are highly non- 
linear. Worse, we find that the commutation relations are actually proportional 
to the inverse of the constraints; 1.e., they probably don’t exist [2, 24-32]. 

As a result, we will be forced to use light cone quantization. 


3.8 Light Cone Quantization of the GS Action 


The counting of the number of independent degrees of freedom within the 
spinors is important when we fix the gauge. Normally, Dirac spinors in D 
dimensions (D even) have 2''/*)? complex components. Thus, the spinors 6! 
in 10 dimensions have 32 + 32 complex components. However. when we 
restrict them to being Majorana spinors, we are left with half as many: 32 + 32 
real components. Restricting them to being Weyl spinors further reduces them 
down by half once again, to 16 + 16 real components. Choosing the light cone 
gauge will reduce the number of independent components down further to 8 +8 
real components. Lastly, when we go on-shell and impose the Dirac equation 
on these spinors, the number of independent components goes down by half 
again, to eight. But this is precisely the number of components necessary 
to form a supermultiplet with the eight bosonic components of the string X,. 
Thus, we have precisely the correct number of components necessary to satisfy 
supersymmetry on-shell. If N is the number of components in the spinors 4! 
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and 67, then we have 


Dirac: N = 32+ 32 complex components, 
Majorana: N = 32 + 32 real components, 
Majorana—Weyl: N = 16+ 16 real components, 
Light Cone: N = 8+ 8 real components, 
On-shell: N = 8 real components. 


Let us now begin this reduction down to the light cone gauge. We will choose 
the gauge constraints: 


ge =), 
bees eer’). (3.8.1) 
Because these gamma matrices satisfy 
i y= yao (3.8.2) 


this means that exactly half of the original 16 + 16 components of the spinors 
are eliminated. The final light cone action becomes remarkably simple in this 
gauge: 


—l . : = 
Se — | a dt(o,xX0 X= iSp’a,S), (3.8.3) 
47a’ 


where we make the substitution 


Bo Ss (3.8.4) 


The essential point is to observe that all the complicated nonlinear terms in 
(3.7.1) that prevented a simple covariant quantization of the superstring have 
now disappeared. (Notice that a curious phenomenon has also occurred. In the 
covariant GS action, 6! and 67 in two dimensions were scalars, independent of 
each other. Now, in the light cone gauge, these two independent scalars have 
merged into a single two-dimensional spinor.) 

Quantization is exceptionally simple because the system reduces to that 
of free particles (while the covariant theory was intractibly coupled). The 
equations of motion are those of free strings: 


(0, + Or) Sas = 0, 


(8, — 4,)S72 = 0. (3.8.5) 
The commutation relations are 
{S42(a, t), S?°(o', t)} = 18°54? 5(0 — 0’). (3.8.6) 


However, we can have, as before, several boundary conditions on these fields. 
For an open string (Type I), we have the following boundary conditions: 
$0. 1). = "0, 2); 


3.8.7 
$0, t) = S?*(x, T). ( ) 
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Notice that these two string fields have the same SO(8) chirality. This is 
because only these relations are compatible with a global supersymmetry 
transformation that equates the two supersymmetry parameters e“. This means 
that the open Type | superstring has only N = | supersymmetry. (Reversing 
the signs in the previous equation, yielding opposite chiralities, would make 
supersymmetry totally impossible.) The normal mode expansion thus reads 


lo 4) 
Sl¢tG. t) = q-1/2 Ye Sag ea 
S74(0, 2) 2a ss cee (3.8.8) 
where 
(S? , S257 o ee (3.8.9) 


For the closed string (Type II), however, we actually have two types of 
options; the fields can either be chiral or not. Closed strings are, by definition, 
periodic in sigma, which yields the following normal mode expansion: 


SG, T) = Ss Saath 
ae (3.8.10) 


0 
AGE t) = SS Seen), 


n=-X 


If these two fields have different chiralities, then they are called Type ITA. If 
they have the same chiralities as in Type I strings, they are called Type IIB. 
Les, 


open and closed string, 


Type I = 
ke | same chiralities, 


closed string, 
TypentA = ane (3:8.11) 
opposite chiralities, 


Type IIB = closed stiitly, 
same chiralities. 


The spectrum of the light cone superstring is especiaily appealing because 
the theory is on-shell supersymmetric, meaning that we should immediately 
see all particle states arranged such that the helicity states of the bosons 
match the number of fermions. The mass of each particle is determined by 
the Hamiltonian: 


mt STU 


N =a'(mass)* = ae +nS* S°), (Ges a PAI 


P| 
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For Type I, the ground state of the theory consists of the massless vector 
particle and its spinor partner. Let |i) represent the eight physical transverse 
polarizations of the massless vector field. Then the spinor partner |a) can be 
represented as 


i 
ia) = g (vi So)" lees (3.8.13) 
We can normalize our states as 
(i]j) = dy, 
(alb) = }(hy*), (3.8.14) 


li) = = (Sovis)" la), 


where / is the Weyl projection operator. (We will use the notation that 
yui2--H» is equal to the product of gamma matrices summed over all 
permutations in the indices. The normalization is such that y'? = y'y?.) 

If we were to quantize the 10-dimensional super Maxwell theory in the light 
cone gauge, then the supersymmetric pair (A,,, YW") would correspond to 

tric multiplet al (3.8.15) 
supersymmetric multiplet: . S. 
P P w? — |a). 

Thus, the light cone theory reproduces the super-Maxwell theory at the lowest 
level. 

At the next level, we have 128 boson and 128 fermion states: 


a, |j) — 64 states, 
128b : 3.8.16 
oy S*, |b) — 64 states, ( ) 
i 1b) > 64 states, 
128 fermions: ek (3.8.17) 
S* |i) > 64 states. 


At the N = 2 level, we have 1152 bosons and an equal number of fermions: 


a’ a, |k) —> 288 states, 

z Sas |i) — 224 states, 
1152 bosons: a',|j) > 64 states, 

a'_,S“, |b) — 512 states, 
S*, |b) — 64 states, 

a!_,S*,|j) — 512 states, 
S*, |i) —> 64 states, 

1152 fermions: a'_,a/ , |a) > 288 states, (3.8.18) 

a’, |a) > 64 states, 

S2,S°, |c) > 224 states, 
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This procedure can be repeated at the next level, where we have 15,360 
states. 

Not surprisingly, we can also regroup these massive multiplets accord- 
ing to O(9). This is because massless D-dimensional supergravity, when 
compactified, yields D — 1 massive states. Thus, the N = | sector, with 
128 bosons and 128 fermions, can be regrouped as 44 + 84 bosons in ir- 
reducible O(9) representations, and a single spin-3 128 multiplet for the 
fermions. At the N = 2 level, we have the bosons being regrouped as the 
9+ 36+ 126+ 156 + 231 + 594 representations of O(9), while the fermions 
can be reassembled in 16 + 128 + 432 + 576. 

For the Type II closed string, the spectrum becomes even more interesting, 
because we obtain supergravity at the massless level, with 128 bosons and 128 
fermions: 


128 Bosons |i) |j) ;|a) |b), 


128 Fermions |i) |a) ; |a) |i) , (3.8.19) 


SUSY multiplet: | 


where the first state refers to the S oscillators and the second to the S oscillators. 
If the fermions have the opposite handedness (Type IIA), then this represents 
the N = 2, D = 10-dimensional reduction of ordinary N = 1. D = 11 
supergravity. 

However, if the fermions have the same handedness, then there are com- 
plications. This theory contains a fourth-rank antisymmetric tensor in 10 
dimensions, with 35 independent components in eight dimensions. It has been 
shown that no covariant action for such a particle exists! Thus, we have the 
unusual situation where a supersymmetric action does not exist for this sec- 
tor. The theory, of course, is still well defined. The light cone theory and the 
S-matrix elements are explicitly calculable. However, the S-matrix, strictly 
speaking, does not seem to arise from a covariant action [33]. 

Finally, if we take the restriction to symmetrized states of the Type I theory. 
we obtain N = |, D = 10 supergravity. Let us summarize the zero slope limits 
of these theories: 


TypeI > N =1, D = 10 super-Yang—Mills, 
TypeI > N =1, D = 10 supergravity, 
Type HA > N =2, D = 10 supergravity, 
Type IIB — nonexistent. (3.8.20) 


At higher and higher levels, it becomes prohibitive to continue this analysis 
of the spectrum to show supersymmetry. We can prove, however, that the entire 
spectrum, at arbitrarily high orders, is supersymmetric simply by showing 
that supersymmetry generators exist with commutation relations that commute 
with the Hamiltonian. We will now show this by explicitly calculating the 
supersymmetry generators to all orders. 
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In the light cone gauge, the two supersymmetry transformations (3.7.6) and 
(3.7.3) become quite simple: 
5S* = (2pt)'?7", 
OX). 


(3.8.21) 


5S? = —i/ pt "0, X' y's, 
(eS (py l4ys £4 94, 


(See the Appendix for a representation of the y matrices. There are three 8 
representations of SO(8). The first is a vector representation 8, which we denote 
by the index 7. The other two representations are both spinorial. We use the 
index a for one representation and a for the other. Thus, the y matrix has indices 
that transform as: y;,,.) We can find an explicit form for the two supersymmetry 
generators in terms of the fields that reproduces the transformations given in 
(228221); 


ae (Gpivese 
C= (ae yi 5 Sora (3.8.22) 


n=—CO 


It is now a straightforward task to calculate the anticommutation relations 
between these generators 


(0", 0") = 2pte”, 
(07, 0} = 2y',p', (3.8.23) 
(Q*, 0°} = 2H3", 


where 
= pee +nS?,S¢)+ip?}. (3.8.24) 


The proof that the sy cone theory is Lorentz invariant is now expanded to the 
proof that the superstring is super-Poincaré invariant. In addition to the usual 
commutators, we must show that 


; [M*”, Ou) ~ —4i(y"” Op, (3.8.25) 


where a, f represent spinor indices in 10 dimensions. Again, the only difficult 
commutator involves the — component of the oscillators: 


a, = ae a (ana. tr (m a in) Sos.) : (3.8.26) 
The difficult commutator has an extra piece: 


Mi = my s By Sec (3.8.27) 


n,m=—CO 
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Because the theory is already uniquely defined to be in 10 dimensions, we find 


[a (3.8.28) 
Furthermore, we also find the commutator for the super-Poincare group: 
[M—, 7) = —y', OF, (3.8.29) 


F 


This complete the proof that the spectrum of the GS model is 10-dimensional 
supersymmetric to all levels. 


3.9 Vertices and Trees 


We now turn to the question of the interactions. We will have to rely on several 
educated guesses as to the nature of the vertex functions, but demanding that the 
vertices transform under supersymmetry places stringent conditions on their 
final form. Having established space-time supersymmetry for the superstring, 
let us now show that this restriction is sufficient to allow us to construct the 
trees. We will demand that the supersymmetry operators take us from fermion 
vertices to boson ones and vice versa. This will impose an enormous number of 
constraints on our theory, which will, in effect, uniquely determine the theory 
itself. 

We demand that, under a supersymmetric transformation generated by the 
Q’s, the vertices Vr and Vz transform into each other 


[n*O%, Ve(u, k)] ~ Va(E, k), 
[nO Vell. k) sent). 
[e* 0%, Vr(i, k)] ~ Vat, &), 
[e*O*%, Ve(C, k)] ~ Ve(a, &), (3.9.1) 


where ¢ stands for the polarization tensor of the massless vector particle, 

and u stands for its Majorana—Weyl spinor partner. Notice that these rules 

are complicated by the fact that a supersymmetric transformation necessarily 

generates rotations in space-time, so that the ¢ and wu must also transform 

properly into ¢, f, and i, i as we make this supersymmetric transformation. 
A natural assumption is that these vertices can be expressed as 


Ve(C. kK) =o Bel®*. 


Ve(u, k) = uFe*%, Ses 


for some vector B and spinor F. Remarkably, when all the details are worked 
out in the light cone gauge, we can satisfy all these conditions for simple values 
of B and F. 

We begin by first calculating the rotated spinors and polarization tensors. 
To calculate how the polarization and the spinor transform, we take the ze- 
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roth component of the supersymmetric generators and see how they transform 
states. At the zeroth level, we find 


fo .. Op)? s-. 
Cop yey pas (6.93) 


These zeroth components transform the polarization tensor into Majorana— 
Weyl spinors. Let us define 


n° Q* |u) = |), 
n° Q* |g) = la), 

3.9.4 
s*Q* |) = [a), a, 
e*Q*|u) = |£). 


Now insert (3.9.3) into (3.9.4) and solve for the transformed spinors and vec- 
tors. It is now a simple matter to show that the rotated spinors and polarization 
tensors are 
ci = Goan, 
aS = Pk (yo! ads 
; —|] faba 
ue = —(ey;;)*k'C’, (3.9.5) 
2 


When we insert (3.9.5) into (3.9.1), we find the solution for the B and the F 
fields: 


Bt = pt 
Bie — RE, 
7 er ee eee rer (3.9.6) 
F* = (2p*) [(y - XS) + 3:07 SRY], 
F¢ = (pe) 2S 
where 
: Bey SS (3.9.7) 


We can now contract over vertices and propagators to obtain trees. The great 
advantage of this approach is that we can calculate multifermion amplitudes 
with almost the same ease with which we calculate multiboson amplitudes. 
For example, the propagator and the amplitude for the scattering of fermions 
and bosons are given by 

=I 


>i ai,ai + nS4,5¢ + 4 (3.9.8) 


An = (0;ki| V(kx)D +++ V(ky_1) 103 ky), (9.9) 
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where the various vertices can be either fermionic or bosonic. The four-point 
amplitude, for example, is equal to 


JB —4s)P(—3t) 
rl — 4s — 31)’ 


where K is a matrix that gives us the appropriate spin structure for the ampli- 
tude. If we let u represent a spinor and ¢ the polarization of massless fields, 


then 
K (uy, So, us, $4) = — tH PTT U3 )ooyu(ks + ka) San 
+ 49(@ PTT U3 )o4u (ko + k3)vooa, 
K (uy, U2, U3, ua) = —45(2T us) Tug) + 5 t() "uz (gC, 43) (3.9.11) 


4= (3.9.10) 


As we mentioned before, the basic form (3.9.10) for the massless spinor and 
vector scattering amplitudes is the same. The only difference comes in the 
factor K. 


3.10 Summary 


In summary, there are two equivalent versions of the superstring theory. The 
relative advantages and disadvantages of the NS—R and the GS actions are as 
follows: 


(1) The NS—R action is linear and easily quantized, both covariantly and 
in the light cone gauge. Trees and loops for bosons are easily con- 
structed. It is manifestly invariant under two-dimensional supersymmetry. 
However, these great advantages must be balanced against the fact that 10- 
dimensional spacetime supersymmetry is very obscure, to say the least. 
Because it is based on anticommuting vector fields w,,, there are problems 
with spin statistics, which requires a GSO projection. Furthermore, the 
vertex for fermion emission is prohibitively difficult to use, thus putting 
multifermion amplitude calculations beyond practical reach. 

(2) The GS action is manifestly supersymmetric in 10 dimensions at all mass 
levels. It is based on genuine spacetime spinors, not anticommuting vector 
fields. However, the action is highly nonlinear, and a covariant quantization 
does not seem possible at present. Presently, it is only quantizable in the 
light cone gauge. In this noncovariant gauge, all amplitudes, including 
multifermion trees, can be calculated. 


Let us review a few facts about the NS—R theory. The complete NS-R action 
is given by 


=, = 2 
L = Valea X"IpXy — iV" 0° Vath + Kao" OW" Dp, 


+ dW" Xap? p*x,)- (3.10.1) 
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This action is invariant under 
OX bye, 
dy" = —p*e(dyX" — W" xa), 


2D SUSY: . oa 
de, = —2iEp* Xp, 


(3.10.2) 
OXG = Vue; 


as well as Weyl, reparametrization, and local two-dimensional Lorentz invari- 
ance. The advantage of this formalism is that the energy-momentum tensor and 
the supersymmetric current generate an algebra, the super-Virasoro algebra, 
that allows us to explicitly eliminate all ghosts in the theory. We do not have to 
impose these gauge constraints by hand, as in the case of the superconformally 
invariant action. 

Our strategy for quantization will be the same one used for the bosonic 
string. We first isolate the symmetries of the action, extract the algebra from 
these symmetries, then use this algebra to eliminate the ghost states and obtain 
a unitary theory: 


Action + Symmetries + Current > Algebra > Constraints > Unitarity. 


If we eliminate the vierbein and the x field, we get the NS—R theory in the 
superconformal gauge: 


] 2 
L = ~5—(8uXyd°X" — iV" pl day). (3.10.3) 
aS 


Because of the invariance of the original action, which is locally super- 
symmetric, we can construct the following currents and set them equal to 
zero: 


a 50” paw" ,X yp, ; (3.10.4) 
a o 
Ta = 9X" AX, + 0" paPeWu + ZW" Prdahy, — (Trace). (3.10.5) 


If we calculate the moments of these constraints, they form the algebra: 


D 
[Lois Lr] = (m ai N)Lomin Sie rau a m)Om,—n; 
et Lie a) = (4m = PGi 
(G,, Gs} = Less + 5D(P’ — 5)5,-s: 


D 3 
[Ce Ln] — (m a N)L min or ge here 


be Care F,] = (4m = Nn) Fe 
Cee en Hs Dm Om, on: 


(3.10.6) 


As in the bosonic case, it is straightforward to make the transition from path 
integrals to the harmonic oscillator formalism. The Hamiltonian is diagonal 
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in the Fock space of harmonic oscillators, so we can remove all intermediate 


functional integrations. . 
We find that the tachyon vertex function with weight 1 can be written as 


kuw" Vo. (3.10.7) 


However, we find that there are two distinct ways or “pictures” in which we 
can write down the N-point amplitude, called the F; and F) formalisms. The 
rules for forming the N-point functions are 


vericx = V, 
tor = 
oe ees =a (3.10.8) 
tachyon = k,, - b-1/2 |0;k) ak = 7 
vacuum = |0;k) , alk? = 1 
Veittex = \, 
l 
F): propagator = T? (3.10.9) 
ee 
tachyon = vacuum = |0;k) , ak? = 7 


The advantages and disadvantages of the two formalisms are as follows: 


(1) In the F, formalism, manifest cyclic symmetry is much easier to prove. 
However, we must carry along the excess baggage of the vacuum state, 
which decouples completely in the theory. 

(2) In the F, formalism, cyclic symmetry is obscure, but ghost elimination 
is easy to perform. The advantage of the F> picture is that we are now 
working in a smaller Fock space. 


The N-point function is represented as 


Viki, V(kys Yn 
Mey) Ven Yn) _ 19.9), 3.10.10) 
y YN 


A, = f aus (0; 0| 


Although the NS—-R model with GSO projection can be shown to have the 
same number of fermion and boson states, the proof that the theory is actually 
space-time supersymmetric is prohibitively difficult. As a result, we will show 
spacetime supersymmetry by postulating the new GS action, where we have 
genuine space-time spinors. The GS action is 

—1 - =) 
S= ae do dt{./gg”’ Tq - Mg + 2ie"?a,X"(6'T,,8,0' — 6°1',,8g07) 
— 26 6'T"d,6'67T,,0,07}, (3.10.11) 
where 


I! 0, X" 0" ago. eee ye 
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This action is invariant under 
504 = 64, 
5X" = je4reas (3.10.13) 
as well as 
$68 =a Tye, 
5X" = i64T“504, 
5(./gg%*) = —16,/e(P2"'8 3,6! + PO’ K7F9,67). (3.10.14) 
The advantage of this GS formulation is that it is spacetime supersymmetric 
to all levels. However, the price we pay for this is that covariant quantization 
of the theory is notoriously difficult, because it is a coupled system even at the 


free level. Therefore, as we will go to the light cone gauge, where the theory 
becomes a linear theory: 


=i Peeing 
S= = | du(d,X'a°X’ —18p°d,S). (3.10.15) 
Ayr a 


This action is also space-time supersymmetric, with generators given by 
Q* = (2p*)'?55, 
Oy Ss S7 a. (3.10.16) 
n=—00 
The anticommutation relations between these generators are given by 
(2°, 0} = 2p*s”, 
(0%, 0%} = v2yi,p', (3.10.17) 
(0%, O°} =2H8%, | 
where 


1 = De 
H= aE Dena +n52, Sn) + > - (3.10.18) 
Next, we will investigate the conformal field theory, which combines the best 
features of both the NS—R and GS formalisms. 
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CHAPTER 4 


Conformal Field Theory 
and Kac—Moody Algebras 


4.1 Conformal Field Theory 


One of the mysteries of the superstring is the existence of two ways of for- 
mulating the theory: the first is the NS-R model (after the GSO projection) 
with anticommuting vectors, and the second is the GS model with genuine 
anticommuting spinors. Each formulation has its own distinct advantages and 
disadvantages. In this chapter, we will discuss the conformal field theory, which 
allows us to see the dynamic link between the two formalisms. The conformal 
field theory of Friedan and Shenker [1] combines the best features of both 
theories. Conformal field theory allows us to: 


(1) Introduce covariant anticommuting spinor fields based entirely on free 
fields. The GS formalism, on the other hand, is based on complicated 
interacting fields that make covariant quantization exceedingly difficult. 

(2) Construct explicit covariant tree graphs for multifermion scattering. In 

the NS-R formalism, however, this is prohibitively difficult because it is 

necessary to introduce complicated projection operators that extract out 
the ghosts. The conformal field theory replaces these awkward projection 
operators with free Faddeev—Popov ghosts, which are easy to manipulate. 

Interpolate between the GS and NS—R formulations and see their relation- 

ship. It provides the bridge by which we can interpret the results of one 

formulation in terms of the other: 


(3 


— 


GS model, 


conformal field theory <> NS Rondel: 
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(4) Construct the covariant supersymmetry generators. This is impossible in 
the NS-R formalism, and is possible in the GS formalism only in the light 
cone gauge. 

(5) Describe both the NS and R sectors of the theory with the same vacuum, 
rather than using the clumsy formalism of two distinct Hilbert spaces based 
on the vacua |0)xys and |0)g uy. This is accomplished by a process called 
bosonization, i.e., constructing fermions out of bosons in two dimensions. 


There is a small price we must pay, however, for conformal field theory. 
Ghosts and antighosts proliferate quite rapidly in this formalism, especially 
for superstrings, and a strange phenomenon called “picture changing” must be 
introduced. Fortunately, these ghosts and antighosts are free fields, and hence 
easy to manipulate. Furthermore, conformal field theory does not seem to be 
derived from any action, as in the GS and NS-R formalisms. Conformal field 
theory stresses the group-theoretic behavior of the fields, rather than proceeding 
from an action. Therefore, we suspect that there is a higher, yet undiscovered 
first quantized action that exists beyond the GS and NS-R actions. 

(We should stress that conformal field theory is not a field theory in the sense 
of second quantization, i.e., where we start with the formalism of Schwinger, 
Tomonaga, and Feynman. The second quantized field theory of strings will be 
discussed in Part II of this book.) 

The essence of conformal field theory is that it stresses the use of conformal 
invariance alone to calculate correlation functions between different fields [2- 
6]. It is quite remarkable that conformal invariance by itself is sufficient to 
determine almost completely the structure of N-point scattering amplitudes. 
For example, in (2.6.9) we have encountered operators that have the following 
matrix elements: 


(O(Z1)O(Z2)) = f(z — 22), (4.1.1) 


where the function f can be a power or a logarithm. For example, the matrix 
element of tree amplitudes between two bosonic strings is 


(X(w)X(z)) ~ log(w — z), 


while the matrix element of two normal ordered vertices is 


(eit X(w) gikX()y ~(w— zy. 

So far, we have used explicit representations of the fields in order to calculate 
the matrix elements. However, the process can also be reversed. If we knew 
all about a ficld’s group properties, we should be able to calculate its matrix 
elements and even reconstruct the field. 

The essential idea behind conformal field theory is to use the conformal 
properties of the field @ to completely determine all of its matrix elements and 
even reconstruct the field. In conformal field theory, this is accomplished by 
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knowing the short-distance behavior of left-moving or right-moving fields 


(21) P(Z2) ~ mane + less singular terms. (4.1.2) 
a= 2)" 
Thus, in principle, all possible matrix elements can be calculated from the 
conformal properties of the fields themselves. 
In conformal field theory, we also construct a spin field S, that transforms as 
a genuine spinor under space-time Lorentz transformations and has conformal 
weight 2, However, coming from the Faddeev—Popov ghost sector, we find yet 
another field with conformal weight 2, It is the product of these two fields, one 
being a spin field and the other being a ghost field, that allows us to construct 
the complete fermion vertex with the proper weight one. This vertex, although 
it involves exponentials of fields, is defined totally in terms of free fields and 
thus solves the problem of constructing a simple fermion vertex function. 
The way in which this spin field is actually introduced is through the process 
called “bosonization” [7, 8], i.e., creating a fermion out of a boson. It is easy, 
of course, to create a boson out of two fermions. However (in two dimensions) 
we have the option of being able to create fermions out of bosons, which was 
once thought to be impossible. Actually, the hint of how to “bosonize” was 
already given in the previous chapter, when we introduced the vertex function 


V = relhX,, (4.1.3) 
In (2.6.22) we saw that 
Vi V2 = Vi Vjel™ he, (4.1.4) 


Notice that, by choosing various values of the momenta in the exponential, we 
can actually create operators that satisfy the relation 


Vi¥2 = CDV. (4.1.5) 


In other words, we can create a field with fermionic commutation relations 
out of bosonic harmonic oscillators. The key to this construction is the normal 
ordering of oscillator modes in two dimensions. This feature does not carry 
over to four-dimensional field theories. (In hindsight, it is possible to see why 
fermions and bosons are so closely linked in two dimensions but not in higher 
dimensions. For the Lorentz group O(1, 1), which has only one generator, the 
concept called “spin” in one spatial dimension does not have much meaning.) 

This method of bosonization via normal ordering of fields will also be the 
key to constructing the conformal field theory. We will construct the fermion 
spin field S, via normal ordering of exponentials of bosonic fields. From this, 
we will construct the covariant supersymmetry operator. The advantage of this 
is that we can now discuss space-time spinors using one common vacuum 
for both bosons and fermions and that the entire construction is based on free 
fields. 

Before we discuss the superconformal case, let us begin our discussion by 
analyzing conformal field theory. Let us make the most general conformal 
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transformation on the world sheet variable z: 
Z —> Z(z). (4.1.6) 


Under this conformal transformation, we say that a primary analytic function 
transforms with conformal weight h if it transforms as 


mn ff 6 2 
$(z) = (2) (=) : (4.1.7) 


(Secondary fields transform as derivatives of (z).) This conformal weight is 
the same concept we introduced back in (2.7.6) when we were discussing the 
conformal properties of vertices. We see now that mathematically the confor- 
mal weight is an index to label irreducible representations of the conformal 
group generated by the Virasoro algebra. 

We can now construct objects that are invariant under a conformal 
transformation: 


b(z) dz" = $(z) az". (4.1.8) 


Once we have defined how fields transform under a conformal transformation, 
we must then check that we have closure of the group under two such conformal 
operations. Let us say that we make two successive conformal transformations: 


Z—> 2(z) > Z2[z1(z)]. (4.1.9) 
Then the field transforms as 
a dz, : 
Uio(z)U; = o(z1) | —} , (4.1.10) 
dz 
Ce dz) dz, \" | 
U,U (z)U POR = #feateu(an =<) (ally 
dz dz 
Thus, the conformal transformations form a group with the composition law 
U; = UU. 
Z3 = 2Z2(z1(Z)). (4.1.12) 


The closure of the algebra can most easily be seen if we take an infinites- 
imal conformal transformation. Consider an infinitesimal variation of the 
coordinate: 


02 == fi) (4.1.13) 
Under the transformation (4.1.7), we have the infinitesimal transformation 
bo, = [€0 + h(de)]o. (4.1.14) 


(We will abbreviate 0. by 0.) If we commute two such variations, then we 
arrive at 


Stet.e2) = (5e,. Se]; (4.1.15) 
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where 
[é1, £7] = E10€) = £20€}. (4.1.16) 


(Compare this relation with (2.1.30).) 

We have shown once again that the group closes with arbitrary weight h. 
To get a better understanding of the meaning of the conformal weight h, let us 
calculate the conformal weight of the string. Under the infinitesimal variation 
of the coordinate, we use the chain rule to show that the string transforms as 


58X,,(z) = €9X,(z). (4.1.17) 


Thus, the string field was weight 0. Likewise, it is easy to show that the 
derivative of the string field has weight 1: 


60X,,(z) = [€0 + (0€)]0X,,(z). (4.1.18) 
Let us summarize the weights of some common string fields: 


Field Weight 


x 0 (4.1.19) 
ax I 
exwex 9 


In particular, this means that the energy-momentum tensor or the Virasoro 
generators have weight 2. 

Weights are additive. The product of two fields of weights h, and hy produces 
a field of weight h; + h2 at the same point: 


GO '(Z)GO"(Z) = GOW), (4.1.20) 
Notice the important fact, which will be used throughout this book, that the 
integral of an object of weight 1 is an invariant: 


5 f $(z) dz = flva + (dv)p] dz 


= f d[ve] dz 
=0. (4.1.21) 


We will use this fact in constructing vertex operators and also the action of the 


second quantized theory. 
Now let us investigate in detail how the energy-momentum tensor acts on 


the basic fields of the theory. 
If the action is written in terms of X,,(z, Z), 


c= = [ @2dx"ox,, (4.1.22) 


then the energy-momentum tensor in (1.9.17) associated with this action is 
Tp = —50X"0X,,. (4.1.23) 
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The transformation of fields under conformal transformations is generated 
by the integrated energy-momentum tensor parametrized by a small function 
gE: 


il 
T. =—— D dze(z)T(z), (4.1.24) 
2i iGa 


where we take the line integral that encircles the origin in z-space. The im- 
portance of the energy-momentum tensor 7 is that it is the generator of the 
conformal transformations we have been studying. To see this, write 


5b(22) = (Te, b(22)] 


= 5 f dze(z)T(z), 629] 


] 
ani Co.2—Co 


l 
=e f Pe(z)T (z)b(z2) 


dz Pelz) T (Qo) 


= aaf dze(z)T (z)P(z2). (4475) 
2mii Jc, 


Notice that the curve Co, encircles both the origin and the point z>, whereas 
the curve C> encircles only the point z2, because 


Cre Cys Cn (4.1.26) 


(see Fig. 4.1). Notice also that we have adopted radial ordering P of our 
operators in the complex plane, where the operators are ordered according to 
their distance from the origin: 


[Zz lzi}. (4.1.27) 


In the last step, notice that we have dropped radial ordering, because the radial- 
ordered products are analytic. 
From before, we also know that 


8d = [cd + A(dE)|¢. (4.1.28) 


By comparing the two expressions (4.1.25) and (4.1.28) for 6¢, we can now 
read off the small-distance behavior of the field ¢: 


= h e ] 
T(z)o(w, w) ~ —— o(w, w) + ——0,@+-::. (4.1.29) 
(aw) cw) 
Since the energy momentum tensor itself has weight 2. we can also insert it 
into the short-distance equation. Thus, we derive the short-distance behavior 
of the generators themselves: 


l 
Fey 5 T(w)+ ——9,.T(w)+---. (4.1.30) 


1 
2(<— wy . (z - w) (z—w) 
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FIGURE 4.1. Contours for conformal field theory integrations. In the first diagram, 
the point z lies between two concentric circles. In the second diagram, the direction 
of the inner contour has been reversed and merged with the outer contour, forming 
a closed path. The point z lies within the closed path, but the origin does not. 


The second term on the right-hand side of the equation shows that the generators 
themselves transform as weight 2 conformal fields. 

Let us now introduce normal modes and make the link with the oscillator 
formalism. We can always decompose any field y of weight h as follows: 


NO) aaa | (4.1.31) 


n=—CO 


This allows us to decompose the energy-momentum into normal modes. We 


write 
dz 
ee f T(z) 
20 i 
[oe L,. (4.1.32) 


It will be instructive to actually derive, step by step, the Virasoro algebra from 
these abstract expressions to see the equivalence of commutators and operator 
product expansions. Using (4.1.30) and (4.1.32), we find 


d d 
pom (br = f ee wit! f Oe oth 


21i 270i 
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tc i Z T(w) + I 77 
——— LT —— Ww —— 
“| @— wt " @— up (@—w)" 
= oe { + 1)w"2T(w) + w"*" 3,7 
Il 


a 
+ aaa (n + 1)n(n — 1) 


= f “ |» + 2)w""*!T(w) — (m+n + 2)w"""|'T(w) 
age! 


+ nin + 1)\(n - ponent 


This finally reduces down to the usual Virasoro algebra: 
[Ems Lal = (mt = Linen + (0? — M)Sn.on- (4.1.33) 


Inserting the expression for L,, into the line integral, we also obtain 
[Ln, O(z)] = {2"*18 + A(n + 1)z"}O(2). (4.1.34) 


(This can also be derived from (4.1.28).) 

We have shown that this description is equivalent to the usual one in terms of 
commutators that we studied in previous chapters. In fact, the previous equation 
is nothing but (2.7.6), which we used to calculate the conformal weight of a 
vertex function. 

Not only can we express the usual conformal commutators in this fashion, 
we can also express the Faddeev—Popov ghost contribution in the conformal 
gauge in this equivalent language. The Faddeev-Popov ghosts will give us a 
different representation of the Virasoro algebra based on b and c ghost fields 
rather than the string variable X. If we write (2.4.1), we have 


bg-- ame V.6E:, 
Ogzz = V:de:. (4.1.35) 


These constraints, tn turn, allow us to calculate the change in the functional 
measure. A simple calculation of the Jacobian of the transformation yields 


Digw Dgz = (det V-)(det V:)De- De:. (4.1.36) 


“ 


To exponentiate the determinant into the action, let us introduce two ghost 
fields, b and c. Then the ghost action (2.4.4) can be written as 


] - ‘on -_= 
Seh = a [ @xo.8c =F b:20-C*). (4.1.37) 


(Note that we have written the ghost action such that the tensorial nature of 
the fields is manifest under the conformal group. In this way, the invariance of 
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the action under conformal transformations is transparent. Thus, the field b-. 
is a second-rank tensor, while c* is a first-rank tensor.) 

Given this action, we can construct the energy-momentum tensor associated 
with it: 


Teh = €9,b + 2(8,c)b. (4.1.38) 


These ghost fields have the following matrix element: 


ecu. 
7 VN} 


If we now carefully calculate the short-distance behavior of these fields, we 
find 
AC. Oa (ere 4.1.39 

ee = Cee yy Be ea wale) 
Notice the factor of —13 in the anomaly calculation for the ghost energy— 
momentum tensor. If we take the sum of these two energy—momentum tensors, 
(4.1.23) and (4.1.38), we find that the combined tensor has a vanishing anomaly 
in 26 dimensions: 


T(z) = Tx(Z) + Ten(Z), 
| D—26 


T(z)T(w) ~ I@— wy 


(4.1.40) 
Notice that the sum of the two energy-momentum tensors has zero central 
term only in 26 dimensions. This, in fact, fixes the dimension of space-time 
to be 26. To summarize, the bosonic representation of the energy-momentum 
tensor in terms of string fields X yields D for the anomaly term, while the 
ghost representation of T yields —26. The sum of these two tensors is the true 
energy—fnomentum tensor, which has vanishing central term only if D = 26. 

Whenever we have a Lie group, the first question to ask is: What are its 
representations? Let us now say a few words about the representations of the 
Virasoro algebra. 

For SU(2) or SU(3), for example, we know that the representations can be 
formed by using the familiar “ladder operators” to construct triplets, octets, 
and higher representations. For a general Lie algebra, we can construct rep- 
resentations by taking all possible products of “raising operators” acting on 
a “highest weight” vacuum. The set of all such states created by the raising 
operators is called the “universal enveloping algebra.” The same process can 
be used for the Virasoro algebra, treating L_,, as ladder or raising operators. 

We will define the highest weight vector |h) as follows: 


Lo |h) =h|h), 
L, |h) =0, wheren > 0. (4.1.41) 
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Notice that any physical state |R) with h = | is a highest weight vector, since 
it satisfies (Lo — 1)|R) = 0. Notice that the vacuum state |0; k) is also a highest 
weight vector, but that |0; 0) is not. 

Now define the set of states generated by all “raising” operators L_, acting 
on the highest weight vector: 


lo) = DM LM... D4 Ih). (4.1.42) 


—Nn2 —NnN 


The set ofall such states is called a Verma module. If |h) is considered a physical 
state, then the Verma module associated with it is the set of all spurious states 
that we can generate from that real state. 

Notice that, under an arbitrary conformal transformation, this state simply 
transforms into another member of the Verma module: 


Q|w) = |w’), (4.1.43) 
where 
Q = ern C bn. (4.1.44) 


This is because a Virasoro generator that hits an element of a Verma module 
simply creates another state within the same Verma module. The Verma module 
therefore forms a representation of the group. 

In addition to the highest weight vector, let us also define the SL(2. R) 
invariant vacuum, which we introduced earlier: 


£4; |0;0) = 0, 
Eo i020) =.0; (4.1.45) 
The relationship between these two different types of vacua is 
|h) = 0(0) |0; 0), (4.1.46) 


where (0) is a primary field of weight /. This is easy to check if we hit both 
sides of the equation with the Virasoro generators. 


4.2 Superconformal Field Theory 


Now let us discuss the more complex question of writing the supereonformal 
field theory. Instead of writing down the transformation of a field as a function 
of a complex variable, we write down the most arbitrary transformation of a 
pair of variables: 


Z=2.0), (4.2.1) 


where # is a Grassmann variable. The most arbitrary transformation of this 
pair is given by 


2(z) = (2(z, 0), A(z, )). (4.2.2) 


; 
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It is disappointing that this transformation is much too general. In fact, it 
doesn’t close properly if we take the product of two such transformations. 
We need to impose a constraint on the system in order to make the group of 
transformations close properly. 

We define the supersymmetric derivative as 


= wan 4.2.3 
mis ee (4.2.3) 
so that 
a 
D=—. (4.2.4) 
dz 


Now let us calculate how this supersymmetric derivative transforms under a 
reparametrization: 


D(D6)D + [Dz — 6D6]D". (4.2.5) 


This composition law is also disappointing because it is highly nonlinear. We 
want, as in the conformal case, a transformation of the derivative to be linear. 
Therefore, we simply eliminate the nonlinear terms by imposing the constraints 


D = (D6)D, 
Dz — 6D6 = 0. . (4.2.6) 
These constraints, which linearize the transformation of the reparametrization, 


are precisely the constraints we want. By imposing them, we can show that 
two distinct superconformal transformations close properly: 


Z>i >i. (4.2.7) 


In summary, we say a field transformation is superconformal if it satisfies 


dz - c (4.2.8) 
(2) = g(@)(DE)". 
It is easy to check that if we eliminate the 9 components, this expression 


reduces back to the conformal equation written earlier in (4.1.7). 
As in the previous section, we can write this down as an infinitesimal 


variation on the fields: 
5.@ = [ed + 4(De)D + h(de)]¢, (4.2.9) 


where € parametrizes the superconformal transformations. Notice that this 
expression is exactly identical to the bosonic transformation (4.1.28) except 
for the term that has two D’s in it. The closure of the algebra can now be 
written 


Stee] = [de,, be,]) (4.2.10) 
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where 
[e1, €2] = €10€2 — €20€; + 3(De1)(De2). (4.2.11) 


Notice that we reduce back to the usual Virasoro case (4.1.16) when we set 


De, and De» equal to zero. 
We find that the Grassmann parametrization can be written in terms of two 


parameters, 0z = &, and 00 = &>: 
&(z) = 6z +660 = ©, +06), 
bz = € — 050 = & + 468), (4.2.12) 
io De = $(€, + 966€,). 


The previous equation is a specific solution of (4.2.6), so we know that the 
group closes properly when we make successive transformations on the fields. 
This solution will be the basis on which we will construct the superconformal 
currents and then the superconformal algebra itself. 

So far, we have made only very general remarks about the superconformal 
group without referring to any specific model. Let us now rewrite the NS-R 
model in terms of conformal fields. The action is 


] a 
c= = / d*zd0 d0 DX" DX,,. (4.2.13) 
The equations of motion are 
DDX, =0 (4.2.14) 
whose solution is 
X#(z, 0, Z, 0) = X*#(z, 0) + X*(Z, 6). (4.2.15) 
Thus we choose 
X(Z.0) = X*@)eae yz) (4.2.16) 
so that the action can be written as 
1 cae 3 ee 
sS= == f @adxrax, — wow — wow). (4.2.17) 


Notice that this is nothing but the original NS—R (3.2.1) action written in confor- 
mal language. From this action, we can read off the super-energy-momentum 
tensor with a slightly different normalization 


T = Tr +OTg=—; Diao. (4.2.18) 
Written out explicitly, this is 

Tr = —3W,aX", 

Tz = —30X"0X, — dv" Yy,. (4.2.19) 


This ts nothing but the currents (3.2.7) and (3.2.13) written in conformal lan- 
guage. Note that the supercurrent J, is now written as part of the same tensor as 
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the energy-momentum tensor T,,,, which means that they are supersymmetric 
partners of each other. 
As in (4.1.25), the variation of a superfield is given by 


| 
$2) = 5 f dzje(z1)9(20). (4.2.20) 


If the field @ has weight h and the energy-momentum tensor has weight 3, 
then ¢ and T transform under a superconformal transformation as : 


T(z )(Z2) ~ he) + - sec a ae 2— + ap Ses 
12 


T(z)T (22) ~ ss += 2 by T (Z) (4221) 


aah 
ot 6 

PD yihay e =C 
2 Z12 £12 


where 


O12 = 8; — &, 
Z2 = 21-21 — Oh. (42522) 


Actually, these relations are a deceptively compact way of expressing a large 
number of equations. To see this, let us expand out the previous expression. 
Inserting (4.2.18) into (4.2.21), we find 

3¢/4 2 
Cay aa eee = oe 

: 
(21 — 2)? 
- 2/4 ; 
Tr@i)Tr(@2) ~ —-—, + — 

ieecun 21 — 2 
These relations, when expanded out, yield the superconformal algebra that we 
wrote out explicitly in (3.2.26) and (3.2.28). Thus, these equations contain a 


considerable amount of information. 
The field transformation in (4.2.21) can be written out in detail if @ = 


go + Of, 


1 
Tez) Tel) ~ ey oR OoTp t+-::, 


= SW oie (4.2.23) 


pc) ee co) ae 


Tp (Z2) + Para) 
2 


1 
Tp(Z1)bo(Z2) ~ ao vane a aes go +--:, 
h + 5 
Tp(z1)O1(Z2) ~ a oy a ee 
Tr(Z1)o(22) ~ ae Pilz) Es (4.2.24) 
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i 


Tr(Z1)b1(Z2) ~ = ———j ¢(zZ2) sae = bo cone 
If we power expand as 
Tr(z) = ; i Ga 
Es(2) = ele (4.2.25) 


Go(2) = Doz" "don 
Oi) = ome” ear, 


n 


then we obtain the transformation of g(z,6) under a superconformal 
transformation: 


[Ls Po(Z)] = 2"! Ago + h(m + 1)z”ho(z), 
[Ls P1(2)] = 24! 0G) + (A + §)(m + 1)z"Gr(2), 
[€Gn, Po(z)] = €2"*""Gi(2), 
[EG Pi (Z)] = Elz" 4/7 OG + 2m + 5 )hz"—/Go(z)], 
[Lm, Go.n] = (A — 1)m — n}ho.m+n: (4.2.26) 
[Eis Pind = [Ch = 3) — aime 
[EGmn, 0.n] = EQi mtn; 
[EGm, Pin] = e[(2h — 1)m — 1] h0.m4n- (4.2.27) 
Thus, relations (4.2.21) contain the entire information concerning supercon- 
formal transformations. 

Now let us calculate the contribution to the anomaly. Comparing (4.1.29) 
and (4.2.23), we see that a boson X,, contributes D to the anomaly, whereas a 
fermion y,, contributes only {D. We can also show that the contributions of 
the b, c ghosts and f, y ghosts yield factors of —26 and +11 for the anomaly. 


Later in this chapter, in fact, we shall show that the anomaly contribution by a 
ghost of conformal weight A is 


= —26(6A(A — 1) 4+ 1), (4.2.28) 
where € equals +1 (—1) for Fermi (Bose) statistics. Summarizing this, we find 


Field Anomaly 


an D 
Wu iD (4.2.29) 
b.c 6 
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Adding these together, we find that the total anomaly contribution is 
D+5D—26+11=3(D— 10) (4.2.30) 


so D = 10 in order to cancel this term. 


4.3 Spin Fields 


So far, almost nothing is new. We have only rederived old results that could 
have been obtained with the harmonic oscillator formalism of the previous 
chapters. We have simply chosen to rewrite the generators of the NS—R algebra 
in a language that stresses the conformal weight and the z-plane singularity 
structure of the commutators, rather than expanding them out in terms of their 
Fourier moments. What is new, however, will be the introduction of a new 
field S, that transforms as a genuine spinor under the Lorentz group. This will 
allow us to rewrite the NS—R model with only one vacuum, rather than two, 
by the process called bosonization. 

Let us write down the generators j“" of the 10-dimensional Lorentz group 
SO(10) in terms of the anticommuting vector field y found in the NS—R theory: 


JO) = vr '(z). (4.3.1) 


We demand that they have the commutation relations of the usual Lorentz 
algebra, but reexpressed in terms of fields defined in the z-plane. Given the 
known anticommutation relations of the w field given in (3.2.23), we can 
readily show 


nh? 7" (w) — w <> vl —o < 7) 


JP)?" (w) ~ 


ye TY 


4+ ————(n** nn" —worv)ter. (4.3.2) 
(z — wy 


Notice that the generators of this algebra are functions of z and hence form a 
larger algebra than that of SO(10). In fact, this algebra is called a Kac~Moody 
algebra and we will use such algebras extensively in this book. 

When studying the group theory behind Lorentz symmetry, we know that the 
representations of the group can be grouped into tensors and spinors. The trans- 
formation properties of these fields can be uniquely determined from group 
theory alone. Similarly, we will define tensor and spinor representations of the 
Kac--Moody algebra associated with SO(10), whose transformation properties 
‘are uniquely determined by group theory alone. These transformation proper- 
ties, it turns out, are so powerful that we can determine the matrix clements 
from them. 
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In particular, a vector, by definition, transforms under the SO(10) Kac— 
Moody algebra as follows: 
1 
+ ULV a ae 
SMC) 8) oe 
A spin field S,, because it transforms as a spinor under SO(10), must satisfy, 
by definition, the following transformation property: 


Cs aaah es a) (is eect (4.3.3) 


‘oa | 
j*" (2) Saw) ~ 4 ——— (vy "Ne Sp(w) + + one 
42 —W 


The remarkable claim of conformal field theory is that the two previous 
identities, which show how vectors and spinors transform under SO(10), are 
sufficient to determine practically all of the correlation functions of the theory! 

The energy—momentum tensor, in turn, can be written as the normal-ordered 
square of the Lorentz generators: 


TY = 5 - Uo ay: (4.3.5) 
Explicitly calculating the commutators of the previous equation verifies that it 
is the Virasoro generator. 

Unfortunately, the spin field S,, has conformal weight 2, which we will prove 
later in this chapter when we construct an explicit representation of these fields. 
We can see this intuitively, however, for the following reason. We saw earlier 
in (4.1.46) that the highest weight vector |/1) of a Verma module can be written 
as 


|h) = (0) |0; 0) , (4.3.6) 


where ¢ is a field of weight / and |0;0) is the SL(2. R) vacuum. For the 
superconformal case, we actually have two highest weight vacuum vectors: 


|h) = S(O) |0; 0), 
Go |h). (4.3.7) 


We wish to eliminate the second highest weight vacuum vector in order to 
preserve two-dimensional supersymmetry. In the superstring, we know that 
we only have one spinor in our theory at the lowest level, not two. In order to 
eliminate the second state, use the identity 


A 


e 
ie: 
(We are taking a slightly different form for the c-number anomaly in the Ra- 
mond algebra in (3.2.28). This choice also satisfies the Jacobi identities.) If 


we hit the state Go |/) with another Go, then we want the state to vanish. This 
means 


Oi (4.3.8) 


a 


Cohn (: zs <) ih) =0. (4.3.9) 
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To satisfy this, the weight h of the spin field must be 2 = 2 , as expected. 

The key to calculating the matrix elements of any conformal field is to 
calculate its short-distance behavior with the superconformal group and other 
fields. Therefore let us now calculate the short-distance behavior of the spin 
field, using invariance arguments alone to determine its short-distance structure 
and its matrix elements. 

First of all, the previous identity (4.3.4) tells us that the operator product of 
two S’s must contain at least a w field: 


l 
Sy(z)S?(w) = aye Ya ule uw) +otherterms. (4.3.10) 

To calculate the operator product of w and S, we consider the three-point 
function 


(Olns Sa(Z1)"(Z2)Sp(Z3) |0)ns (4.3.11) 


where the vacuum is the NS vacuum. In the limit of z} —~ oo and z; > 0, 
the spin field changes the NS vacuum into a vacuum with spinor quantum 
numbers, i.e., the R vacuum |0)p: 


Sa(0) |0)ns =10)R as (Ol s Sa(00) = Ue (Olp (4.3.12) 


(Notice that the spin operator allows us to go from the NS vacuum to the R 
vacuum, which was not possible in the earlier NS—R theory.) This means that 
(4.3.11) can be written as 


Ug (Ola W"(Z2) |0)R up. (4.3.13) 


But only the zero modes of the w field survive this vacuum expectation product, 
so we are left with the orien element of a Dirac matrix: 


W"(z)Sa(w) ~ a aya Na S62) + (4.3.14) 


This, in turn, means that y occurs in the operator product of two S’s: 


Sa(z)S?’(w) ~ Aya (4.3.15) 


1 
Vie — wy 
Finally, we need to know the short-distance behavior between two spin fields. 
We saw that the conformal weight of the spin field is 2. Thus, 


Sa(z)S?(w) ~ —68(z — wy 4 tees, (4.3.16) 


where 3 is twice the dimension of the spin field. 

Finally, we can bring (4.3.10), (4.3.14), and (4.3.16) together, which yields 
the short-distance behavior of two spin fields. In summary, invariance argu- 
ments have established the short-distance behavior of the spin field, which can 

be represented as 


Sa(z)S8(w) ~ ———— (wh w"(z) 


= 
eauyh* aga 
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1 
Say Pm a) 
/2(z — w)'/4 ae 
We saw that the spin field has dimension 2, What we want is a fermion 
vertex of dimension 1, suitable for use in a multifermion amplitude. To find 
the missing factor of ;, let us now turn to the ghost sector of the theory. The 
ghost sector will give us the final missing piece. 


4.4 Superconformal Ghosts 


Using conformal language, let us now rewrite the Faddeev—Popov determinant 
arising from the superconformal gauge fixing. Using (3.4.5), we find 


Ogege= V-0ba, 
8x. = V.68, (4.4.1) 
and their complex conjugates. Thus, the Faddeev—Popov determinant is 
det. (veo)idet, (v2), (4.4.2) 


and their complex conjugates. At first, this looks very much like the determinant 
we found in (2.4.3) for the Nambu—Goto string, 1.e., the determinant of d. and 
0,. There are, however, several important differences. The Hilbert space over 
which these operators act has now changed. If we expand the determinant 
in terms of basis states, we find that the conformal weights have changed, 
the tensor transformations under the conformal group have changed. and the 
Statistics of the fields is reversed. 

When we exponentiate these determinants into the action (see (1.6.16)), we 
arrive at the ghost action: 


l we 
Sh = = faz d@ dagBDC + c.c., (4.4.3) 
where 


Cae (4.4.4) 


C(z) = c(z) + 6y(z), 


where f and y are commuting operators. Performing the integration over 0, 
we have 


l ee 
Soh — s | @05c qr Boy) ap (Xe. (4.4.5) 
DB =0, 

DC =0, (4.4.6) 


where (b, c) are the ghost fields arising from gauge fixing the metric, and the 
B, y are ghosts arising from fixing the xq field (cither in the NS or the R sector). 
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The superconformal ghosts (which have commutation relations among them) 
were first encountered in (3.5.14) when we quantized the NS-R model. The 
difference now is that we wish to stress the conformal properties of these fields, 
i.e., their weights. 

If we summarize the weights of these fields, we find 


Field Weight Statistics 


b 2 Fermi 

EG —| Fermi (4.4.7) 
B ; Bose 

y ; Bose 


Given the action (4.4.5), we can extract out the energy-momentum tensor, 
which is given by the sum of two pieces: 


Tx = —}DX"0X, = Tr + OTs, 


; : (4.4.8) 
Tyh = —COB + 4 DCDB — 30CB. 
Written out explicitly, we have 
T?"(z) = —cdB — 3acB + hyb, 
Ts" (z) = cdb + 2dcb — 37 0B — 38yB. (4.4.9) 


It is awkward always having to write down b and c and B and y, especially 
when the equations for the bosonic and fermionic fields are so similar. We will, 
therefore, adopt a system where we can describe all ghosts fields at once, with 
arbitrary weights. Let us write down the generic ghost action, using boldface 
fields to represent either the commuting or anticommuting ghosts: 


1 = 
sS=— / d’zbac + c.c., (4.4.10) 
I 
db = dc = 0, (4.4.11) 


where we define the field b to have arbitrary weight A and the field ¢ to have 
weight 1 — A. Remember that the anticommuting b, c ghost system has A = 2 
and that the commuting f, y ghost system has weight A = 2. This action is a 
compact expression for (4.4.5), except now we have generalized our discussion 


to all possible conformal weights. 
Starting from this generic action, we can easily write down the energy— 


momentum tensor: 
T(z) = —Abde 4+ (1 — A)dbe. (4.4.12) 
, If we take the decompositions: 


b= Se a, 


néd—A+Z 
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c(2) =." So aeaeaaai ee (4.4.13) 


nédta+Z 
where 5 equals 0 for the NS sector and ; for the R sector. With this 
representation, the Virasoro generators are 


Lit =D (k — (1 — A)n)bn—aee- (4.4.14) 
k 


We can check that these generate the usual commutation relations for 
arbitrary 2. 

In addition to the energy-momentum tensor, we can construct two other 
currents from the action, the BRST current and the ghost number current. 

Following (1.9.12), the BRST current is due to the fact that the original 
gauge action, plus its Faddeev—Popov ghosts, has a residual gauge symmetry 
that is nilpotent (and hence cannot be used to eliminate any more fields). Any 
gauge symmetry has a current associated with it, so the BRST current can be 
derived directly from the action: 


Jarst(z) = DC(CDB — 3 DCB). (4.4.15) 
Following (1.9.12), the BRST charge is the superintegral of the BRST current: 


1 
QOzrst = = f dzd6 Jans (4.4.16) 
Ti 


This charge can be divided into three pieces: 
Osrst = 8 + 9 + Q™, (4.4.17) 


where 


Qo = f dz(cTs(X, vB, y) — e8eb), 


I 
Q”) = En f dzhy Wy0x", (4.4.18) 


QQ?) = a, 1-1 v7] 
ai lo 


By carefully analyzing cach piece, we can show that the sum is nilpotent: 
Onst = 0. (4.4.19) 


In addition to the BRST current, there is also the U(1) current, called the 
“ghost number current.” Naively, because the ghosts always occur in pairs in the 
action, we expect that they must conserve a certain quantum number, somewhat 
analogous to the situation with baryon number. However, surprisingly enough, 
we find that there is a correction term to the ghost current. Let us write down 
the ghost number current, which is just bilinear in the ghost fields: 


i@==—be= De (4.4.20) 


n 
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where 


= a (4.4.21) 
k 


€ = +1 (Fermi) or ¢ = —1 (Bose). We have all the identities necessary 
to calculate the short-distance behavior between the ghost current and the 
energy—momentum tensor: 
. Q j(z) 
T(z)j(w) ~ —— + ——— + - s—" 4.4.22 
MOS ame (4.4.22) 
where Q = e(1 —2A). (Notice that the presence of this Q factor is anomalous.) 
This implies 


[Ls jn] = —Njm+n + 4 Qm(m + 1)bm,—n- (4.4.23) 


This ghost number current assigns a ghost number to each of the ghost fields. 
This ghost number current has unusual properties under complex conjugation: 


ih, = —j-m — Q5mo, (4.4.24) 
where 
Q = e(1 — 2d), 
Pa $(1 —€éQ), (4.4.25) 
€ = +1 (Fermi statistics); — | (Bose statistics). 


(If the Q were missing in (4.4.24), it would have normal complex conjugation 
properties.) The quantum numbers of the ghost fields are - 

Deo is = le e, OS=3, C= —26, 

Oy eo =—la—=, O= 2) | c= 11) 
and the anomaly contribution of the ghost is c = —2e(6A(A — 1) + 1). One 
of the unusual features of this ghost structure is the existence of an infinite 
number of vacua, which arises because of (4.4.23) and (4.4.24). Let us define 
the vacua by 


(4.4.26) 


Db, ig) — 0; n>eéq-i, 
C10) 0; n>-—éqt . (4.4.27) 


The zeroth component of the ghost number current and the Lo have the 
following action on these states: 


jolg) = 14). 
Li \q) = $eq(0 +9) 19). (4.4.28) 


In particular, the last identities show that the only nonvanishing matrix elements 
are 


(-—q — Qlq) =1. (4.4.29) 


The easiest way to show this is take the matrix elements of the currents between 
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different vacua, where each vacuum is labeled by the number q: 


(q'|Lo |q) » (q'| jo 14) - (4.4.30) 


All the matrix elements are zero, except when gq’ = —g — Q. 

This is highly unusual. In the usual Veneziano model, there was only one 
unique vacuum. Now, there appears to be an infinite number of vacua labeled 
by g for the ghost sector of the NS and R models! The existence of an infinite 
number of Fermi and Bose vacua is one of the unusual features of conformal 
field theory. 

This means that there is an anomaly in the ghost current. The problem lies 
in (4.4.22) and (4.4.23), i.e., the fact that the U(1) ghost current has anomalous 
commutation relations with the energy-momentum tensor. The anomalous 
term corresponds to the violation of the conservation of the ghost number. 
In fact, the divergence of the current is given by 0: j- = 1 efi 1,@..,/0hte 
two-dimensional curvature density. The ultimate origin of all these difficulties 
actually lies back in (4.1.36), when we calculated the Faddeev—Popov deter- 
minant over V-. and V-. A careful analysis of the eigenvalues of these operators 
shows that we have to remove the zero modes, or else the determinants make no 
sense. When we exponentiate these determinants by expressing them in terms 
of Faddeev—Popov ghosts, these zero modes, in turn, correspond to nontrivial 
solutions to the equations 0:c* = 0:b.. = 0. We cannot give a full discussion 
of these zero modes, unfortunately, because it requires new information that 
will not be discussed until Chapter 8, when we analyze anomalies in detail. 
(Briefly, this anomaly can be shown to be related to the topology of the Rie- 
mann surface swept out by the string. By integrating the divergence equation 
for the U(1) ghost current, we can use the Gauss—Bonnet theorem to show that 
f{ d?z/gR = —87(g — 1), where g is the number of holes or handles in 
a closed Riemann surface. Then we use the Riemann—Roch theorem, which 
tells us that the number of zero modes of c minus the zero modes of / equals 
(1 — 2A)(1 — g). The zero modes of ¢ correspond to conformal Killing vectors, 
and the zero modes of b correspond to moduli. This is a useful result, because 
it tells us that the number of complex moduli for a sphere with ¢ handles is 
3g — 3, which we will use extensively in Chapter 5. It also tells us that the 
number of supermoduli is 4(g — 1), which is difficult to prove using other 
methods.) 

It can be shown that the Fermi vacua (for the ghosts of the usual Nambu—Goto 
bosonic string) are actually equivalent vacua; i.e., by multiplying Fermi vacua 
with various monomials of b and c, we can reproduce the other vacua. Thus, 
the various vacua yield equivalent representations. However, the situation with 
the Bose vacua (for the ghosts of the NS-R model) is different. We find that 
the various vacua are actually inequivalent vacua. No monomial in the fields 
B and y can take us from one vacuum to another. 

For the NS sector, the Bose vacua are labeled by integers, while for the 
R sector the Bose vacua are labeled by half-integers. The vacuum states that 
come closest to the usual definition of the vacua (i.e., they are annihilated by 
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all positive frequency parts of the oscillators) are 


-4). 
R | 
NS: {|-1). (4.4.31) 
They are normalized as follows: 
3 {-H=1, 
(-I]-1) =1. (4.4.32) 


Although the appearance of an infinite number of inequivalent vacua appears at 
first to be a disaster, we will show later on that this yields a perfectly acceptable 
theory. In particular, we will show that, on-shell, it really makes no difference 
which of the various vacua we choose. All on-shell matrix elements for the 
same physical process, with arbitrary choices of vacua, will yield the same 
numbers. In fact, we will be able to create “picture changing” operators that 
will take us from one vacuum to another. The situation is exactly analogous 
to the one found earlier with the F; and F) pictures discussed in (3.3.18) and 
(B29). 

Having worked out the structure of the ghost sector of the NS—R model, our 
next step is to find the field that gives us conformal weight 3, the missing piece 
in the fermion vertex function. 

Let us now bosonize the ghost current by introducing a new field @ into the 
theory: 


j(z) = ed¢(z). (4.4.33) 
The new object we wish to investigate is 
rene, (4.4.34) 


Its short-distance behavior is given by 


je ~ 1H) 4 ..., (4.4.35) 
mgt) 


and 

T (z)et*™) ~ [Leq(q + Oz — wy? + (z — w) "0, Je*?™ +++. (4.4.36) 

This means that 
79) 10) = Iq), (4.4.37) 
weight: seq(q ta) (4.4.38) 
This means that multiplying by the bosonized field e’? allows us to go from 
one ghost vacuum to another. Notice that the NS ghost vacua are integral, while 
the R ghost vacua are fractional. Because q can be fractional, this allows us 


to go back and forth between the various NS and R vacua by multiplying with 
ef? 
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Let us use this bosonization technique to write the anticommuting b and c 
fields in terms of a new scalar boson field o: 


Pen, 
e(z) = e™, (4.4.39) 


(We can check that the fields have the correct conformal weight. From (4.4.38), 
a field ce? has conformal weight $qg(q — 3). Then for g = —1, the field e~” 
has weight 2, while for g = | the field e” has weight —1. They therefore have 
the correct weight.) Notice that both the left and right sides are anticommuting 
fields, even though o itself is a commuting field. It is now easy to show: 


| 


—o(z) g0(w) aS : 
Li) 


(o(z)o(w)) = log(z—w), 


For the NS—R ghost sector, however, this is more subtle. These ghosts are 
already commuting, so bosonization does not seem possible. We can, however, 
use a trick: 


p=e %8é, 
Veen: (4.4.40) 


where the left-hand sides are commuting fields, and the right-hand side is the 
product of two anticommuting fields; i.e., § and 7 are also anticommuting 
fields (which, in turn, can be bosonized). Thus, we have written commuting 
fields in terms of anticommuting fields. We can also reverse this procedure: 


p= Over. 
a& = dBe®. (4.4.41) 
Notice that the € and n fields are themselves anticommuting, so we can 
bosonize them. Let us express these two anticommuting fields in terms of the 
boson field x: 

ease", 
n=e%, (4.4.42) 
Although conformal field theory has the great advantage that all fields are free 
fields, one small price to pay is that we must keep track of all these different 


free fields. Because it is crucial to keep all these boson fields clearly defined, 
let us tabulate their quantum numbers: 


Boson Charge Anomaly Weight 
@ Q=2 c=13 wt(expqg¢) = —34(q +2) (4.4.43) 


xX Q=-1 c=-2 _— wt(expqx) =ig(q-1) 
o Q=-3 c= -26 wt(expqo) = iq(q —3) 


There is a crucial subtlety in the definition of the Bose NS—R ghosts. Notice 
that 6 is defined in terms of the derivative of &, so that it is independent of 
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the zero mode of the & field. Thus, the usual Fock space is independent of the 
zero mode of the & field. Thus, we have two possible Fock spaces. The “small” 
Fock space is missing the zero mode of &. The “large” Fock space contains the 
zero mode of & and is reducible. Because 


[no, 0] = 1, (4.4.44) 


this means that the vacuum of the 7, € system is degenerate. 
The purpose of this construction is that we can now write down the missing 
piece of the fermion vertex operator. 


4.5 Fermion Vertex 


Let us calculate the conformal weights of the bosonized fields: 
we(e OM) = Feq(q + Q) =F, 
wt(e"/*) = Leq(q + Q) = —3. (4.5.1) 


We now have found the missing piece. Since the bosonized field e~"'/?* has 
weight ;, we can now construct the true fermion vertex of the theory: 


Vai en Ss, (4.5.2) 


which has conformal weight 


$43 +k’, . (4.5.3) 
If we place the external fermion line on-shell 
0, 

VY kay = 0, (4.5.4) 


then we have a vertex of conformal weight 1. This vertex has the correct 
properties under the BRST charge: 


{Qsrst, V-1/2}4+ = 9 (4.5.5) 


up to terms that vanish on-shell. This is one of the great accomplishments of 
conformal field theory, the creation of a genuine fermion vertex function with 
conformal weight | based entirely on free fields. The key observation was to 
use the ghost sector to supply the missing conformal weight of 3. 

Although we have now attained our goal, we are not yet out of the woods. 
We mentioned earlier in the last section that there is a problem with respect to 
the infinite Bose sea. It turns out that if we insert this fermion vertex within a 
boson scattering matrix element, we get 0: 


Vee VE eat) (4.5.6) 


What is needed, of course, is a vertex with ghost charge +4 to cancel the —5 
coming from the fermion vertex. This new vertex V;,. must anticommute with 


166 4. Conformal Field Theory and Kac—Moody Algebras 


the BRST charge up to terms that vanish on-shell. It is easy to show that any 
vertex 


V = [Qszprst; ©] (45:7) 


for arbitrary has a vanishing anticommutation relation with the BRST charge 
because Q is nilpotent. However, all these vertices are spurious. These states 
are null and do not couple to real states |R), which satisfy the relation 


Qzrst |R) = 0 (4.5.8) 


so they cannot be used as vertex functions. They simply produce zero matrix 
elements with the physical sector of the theory. However, there is a vertex 
function for which this reasoning does not apply: 


Vij2 = 2[Qprst, € V_-1/2].- (4.539) 


Normally, we would expect that such a vertex is also spurious and does not 
couple to real states of the theory. However, & V,;, as we said earlier, is not 
part of the irreducible Fock space of the theory, and thus we cannot simply say 
that this commutator vanishes on contraction with real states. This vertex does 
not necessarily vanish because 


QOprsté |0) 4 0. (4.5.10) 


After working out the details, we find that this vertex is equal to 

Vin = ur(kyee fe aX" + Lik WW Wy op S° the? “nbSy). (4.5.11) 
This, it turns out, is the correct fermion vertex that allows us to contract onto 
Vis: 


However, at this point we realize a rather disturbing problem. We now have 
too many possible vertices! For example, we could also have written 


V3. = [Qprst, €Vi/2], 
Vsy2 = [Qarst, & V3/2], 
v= (4.5.12) 


In fact, there is an infinite number of such vertices, each one linking inequiv- 
alent Bose sea vacua. This is certainly an embarrassment of riches. However, 
it is possible to show that we need only use V; » and V , >, and that the other 
vertices do not yicld any new matrix elements. We wish to show the following 
identity: 


(+++ Viijo(ty, ki, 21), «++ Viyo(ua, ka, 22) ° ++) 
= (: ae Vi (uy, ky, a eA a 1/2(U2, k>, Z2) sie ar (4.5.13) 
The proof that we can simply switch the various and —! ghost indices on 
the vertices at will involves some rather subtle arguments that take us between 


the irreducible small Fock space (which does not include the zero mode of £) 
and the reducible large Fock space (which does include the zero mode of €). 
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We begin by rewriting the vertex V,,. in the equivalent form 


1 
Viale) = = f dw jarst(w)& (z2)V_1/2(z9). (4.5.14) 


Notice that the vertex is now written in the large Fock space, so we must 
insert another &(z) to redefine the vacua so that we have a nonvanishing matrix 
element. However, because the position of &(z) is irrelevant (because we want 
only its zero mode), we can now rewrite the matrix element as 


l 
( = O23) Vyas ei f dwijprst(wé (22) V_1/2(Z2) °- }. (a6) 


So far we have done nothing. We have simply gone from the small Fock space, 
where the zero mode of € never appears, to the larger Fock space, where it 
does. However, the value of the matrix element remains precisely the same. 

Now we make the following observation. The contour integral that encircles 
z2 can be changed at will, so let us enlarge it until it goes around the back of the 
Riemann surface (a sphere) and finally comes around and encircles the point 
z,. Of course, the enlarging of the contour cuts across other bosonic vertices, 
but notice that jgrst commute with all of them on-shell, so that we can move 
both the contour and the current jgrst at will until they encircle z,. 

At this point, notice that the contour integral is now around & V_;/2, which 
can now be written as V,)/2. Thus, we have totally reversed the position of the 
contour integral, and we can eliminate all the € and go back to the small Fock 
space. Symbolically, the steps can be represented as follows: 


Oe Vetoes Viggo) f te Ve) E ant jaasr¥-1 | cs ) 


i ( se E dwt oasr¥-17| ete V= 172) | 


> (Vie Via’). (4.5.16) 


The purpose of this exercise was to show that we can successfully reverse the 
position of the ; and —4 in the matrix element. This means that, although there 
is an infinite number of fermion vertices, they will all yield the same on-shell 
matrix element. 


4.6 Spinors and Trees 


In order to calculate tree amplitudes, it is necessary to construct an explicit 
representation of the spin fields in terms of NS—R operators. Although these 
fields were highly coupled operators in the GS formalism, the great advantage 
of the conformal field theory is the fact that the interacting spin field can 
be composed of free fields, making the calculation of correlation functions 
explicitly possible. 
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We will construct the spin field out of the generators of the SO(10) algebra, 
which in turn are composed of the y(z). The SO(10) algebra is a rank 5 
Lie algebra (see Appendix). This means that, out of the 10(10 — 1)/2 = 45 
generators of the SO(10) algebra, five mutually commute among themselves, 
forming the Cartan subalgebra. The commutation relations among these five 
commuting generators and 40 noncommuting elements are 


Wea he Hj] = 0, 
[Hi, Ea] = aE, (4.6.1) 
[Ea, Eg] = £(a, B).Eaigs 


The last identity is true if E,,, is a raising or lowering operator. Each a@ is a 
root vector of the group SO(10). The € terms are the structure constants of the 
group, which obey various symmetry conditions and associativity. 

Let us now introduce five mutually commuting fields ¢; and express the 
45 generators of SO(10) in terms of them. We can represent the five mutually 
commuting members of the SO(10) algebra as 


To represent the remaining 40 generators, write 
Eq =: #i:¢,, (4.6.3) 


where f is a matrix that takes us from the root vector @ into a linear combination 
of the ¢; vectors: 


Ba = Gal, 1, 05070), (4.6.4) 


We also include its permutations. Notice that there are four possible arrange- 
ments of the + and — signs and that there are 10 possible ways in which these 
signs can be placed into five slots. Thus, this matrix has 40 elements in it. Each 
slot corresponds to one of the five @;. 

The purpose of the factor cg in (4.6.3) is to make the commutation relations 
come out correctly. If we insert these expressions back into the definition of 
the algebra, we find that we must set 


Cals =elQrBjes <a. (4.6.5) 
If we demand associativity of the commutators, then we also have 
e(a, B)e(a + B, y) = e(a, B + y)e(B, y). (4.6.6) 
One of many possible representations of these two cocyeles is 
(ax B= (1), (4.6.7) 


where 


o(a, a) = }(a, a), (4.6.8) 
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and 
a(a, B)+0(B, a) = (a, 3) mod2 (4.6.9) 


(see [9, 10] for other conventions.) 
We can also represent the NS anticommuting vector field in terms of this 
bosonization. Let us write 


rrr = relPibice, (4.6.10) 
where 
Pa = (+, 0, 0, 0, 0) (4.6.11) 


with permutations. There are 2 x 5 elements within p, which is the number of 
elements within y“. 

Let us now represent the spin field in terms of this bosonized picture. Let us 
define the matrix 


= ee ence). (4.6.12) 
Notice that there are 2° = 32 terms in this matrix. Let us define 

Sa = 1e*4i:¢,. (4.6.13) 
This field S, has the correct number of components for a spinor in 10 dimen- 


sions. cipal it has weight 2. This is because each individual factor has 
weight ! 3» and there are five of them, for a total of ; >. This confirms the ear- 
lier observation made in (4.3.9), which was based purely on group-theoretic 
arguments, that the spin field S, has weight 2. = 
The field e~''/?” has weight 3, and e/? hs weight —2. Thus, our vertex 
with weight | is given by 
Vijpig = Drew eee (4.6.14) 


Now that we have an explicit representation of the spin field, we can calculate 
matrix elements for fermion—fermion scattering. 

Let us calculate the four-fermion scattering amplitude, represented by the 
product of three independent factors involving X, Sy, and e~('/"*: 


(Vo; (Z1) Vern (22) Vers (Z3) Vor, (Z4)) (4.6.15) 


we Set Z} = 00, 72 = 1,23 = Z, Z4 = 0. = 
Let us calculate each factor separately. Using the formula derived earlier in 
Chapter 2, we find that the X-dependent factors are equal to 


(V(ki, 21)V (ko, 22)°+» Vk, zw) = | [le — 2s), (4.6.16) 
Ie) 


where 
Vik, 2)= ef kX(2). (4.6.17) 


and the sum of all k; equals zero. 
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Let us now specialize to the case of the ghost contribution, which equals 
(e201) 9“ U/2)4(0) p26) 9-1/2) — E21 — zy 4. (4.6.18) 


Lastly, the spin field contribution is 


(Sa(0o)Sp(1)Sy(z)S5(0)) 
= [2(1 — zl — zy" )ap (Yu )as — ZY )as(Y)py }(4.6.19) 


Putting all three pieces together, we arrive at 


1 
Ag = e | dzze kh] a zee! 
0 


x {1 —2)(y")ap(Yudys — 207" )as(Yudpy}- ——-«(4.6.20) 


Rewriting this amplitude, we finally arrive at 


Ag = g° {BL — $t, —$5 (Yap (Yu)ys 
— B(—5t, 1 — $5)(y")a5(Yu)ey } - (4.6.21) 


This calculation and others involving N-point fermion scattering amplitudes 
are not that difficult with conformal field theory [9, 10}, but would have been 
extremely difficult in the older covariant NS—R and GS formalisms. 


4.7 Kac—Moody Algebras 


Although the results of conformal field theory are powerful, to the beginner 
the various rules and conventions may seem a bit arbitrary and random. It 
appears, at first glance, that conformal field theory depends on clever tricks 
and accidents rather than anything fundamental. 

In reality, the underlying consistency and elegance of conformal field theory 
are due to new infinite-dimensional Lie algebras, called Kac—Moody algebras 
[11-22], which are powerful extensions of the usual finite-dimensional Lie al- 
gebras. They were discovered by mathematicians V. G. Kac and R. V. Moody 
in 1967, although one form of them was already known to physicists in the 
mid-1960s as current algebras. Together with the superconformal group in 
two dimensions, the SO(10) Kac—Moody algebra provides the mathematical 
framework for conformal field theory. Many of the matrix elements for con- 
formal field theory, in fact, can be viewed as Clebsch—Gordan coefficients for 
Kac—Moody algebras. 

We define a Kac-Moody algebra to be a generalization of an ordinary Lie 
algebra such that its generators obey 


[Deeds Vf Te eee, (4.7.1) 


The algebras looks very much like an ordinary Lie algebra, except for the 
infinite integer index m on each generator and the constant k, which is called 
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the level. The zeroth component of the T’s is nothing but the algebra of a finite 
Lie Algebra. We will often find it convenient to rewrite the generators of the 
Kac—Moody algebra as the Fourier components of a single function defined 
over a circle: 


(Ce en | (4.7.2) 


We can also break down the generators of the Kac-Moody algebra into the 
Cartan subalgebra and its eigenvectors: 


H,(0), — E,(6). (4.7.3) 


Thus, a Kac—Moody algebra looks like an ordinary Lie algebra smeared over 
a circle. 

Let us now construct what is called the “basic representation” of the Kac— 
Moody algebra using vertex operators. This representation holds only for 
simply laced groups (i.e., groups with roots of equal length, which are A, 
D, and E Lie groups) with level one (k = 1). Let us first define the string 
variable 


i ,4—iné 


$:(0) = 4: + pb +i > > —. (4.7.4) 
nxo)6|OCU 
Now introduce the basis vectors for the lattice of our Lie algebra such that 
ej -e; = Orn (4.7.5) 


This allows us to write the string variable and the vertex function as vectors 
on the lattice: 


o6) =) edi, 
qiOjiane re, (4.7.6) 
where the c; are the familiar cocycles introduced in (4.6.5) in order to get 


the commutation relations with the correct sign. There are many possible 
representations, one of which is 


Ci = x | in (x-5) ey | : (4.7.7) 


k<i k>i 
Finally, this allows us to introduce the generators of the Kac—moody algebra: 


ni dd; 
H,(0) = :4; (0)qi(@): = SST 


aa aaa He cee). (4.7.8) 
where a = e,; — e; and we use the + sign fork > j and the — sign fork < j. 
Because the ith and jth elements commute, the H’s are generalizations of the 
Cartan subalgebra, i.e., the set of mutually commuting elements of the Lie 
algebra. 
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Let us now calculate the commutator between these generators. In much the 
same way as before, we find that the product of two vertex functions is 


Viase= <2, 
V(x, O)V(B, 6’) = AO — 6')~*F eft POFB HO), (4.7.9) 


where 


A(G=6 =e 1 (ieee eee, (477 


i 
= 6 +12 
We now have all the identities necessary to compute the commutator between 
all the generators. We find, by direct computation, 


[A @) AAG) 90, 
[H;(0), E,(6’)] = —2776(6 — 6’)ax; Ey (0). (4.7.11) 


Notice that the H’s are mutually commuting, as they are in the ordinary Lie 
algebra, and that the a; are eigenvalues of the H’s. The remaining commutators 
between the various E’s are given by 


[Ea(0), E-o(9')] = 2780 — 8’) 9 a H(9) + 27158 - 8") (4.7.12) 


and 
276(8 —O)Eaia(0) ifa+ Be), 


4.7.13 
0 otherwise ( ) 


[E.(9), Ea(6") = | 
(T is the root lattice.) Thus, we see that the commutators of a Kac-Moody 
algebra are very similar to the ordinary ones for a Lie algebra, except that the 
generators are smeared over a circle. 
It is also crucial to notice that the semidirect product between the Virasoro 
algebra and the Kac—Moody algebra is possible. The commutators are 


ee ae 


m+n? 


(Es, be lee on ee ssn’ aie (4.7.14) 


Depending on the representation of the algebra that we choose, we can get a 
correlation between the level & of the Kac—Moody algebra and the central term 
c of the Virasoro algebra. One remarkable property of the Kac—Moody algebra 
is that we can construct a representation of the Virasoro algebra entirely in terms 
of the Kac—Moody algebra. Let us write the Virasoro generators smeared over 
a circle; 


L(@)=1N bs :H, (0): + ye FOE 00) 
d 
iN (> 10: : (4.7.15) 


i=) 


I 


This is called the Sugawara form. 


~ 
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We have all the necessary identities to commute these past each other, and 
we find 


[L(6), L(6)] = 27186 — 6’)(L() + L6’)), 


é 
- el Cl —6')+8(0—-6’)), (4.7.16) 
where 
nem 
1+ 362(G) 
= ¢ (4.7.17 
1+ 40(G)’ 


where d is the dimensionality of the group and c(G) is the value of the 
quadratic Casimir operator for the adjoint representation. 

The last commutator yields the semidirect product between the Kac-Moody 
and Virasoro algebras: 


[L(0), E(6)| = 86 —0’)E.(0). (4.7.18) 


We compute the value of the central charge for a few groups: 


c Group 
n—1 SU(0) 
n+ ; SO(2n + 1) 
n SO(2n) 
n E; 
n(2n + 1) 

: air Sp(7) 


From the perspective of Kac—Moody algebras, let us now reanalyze confor- 
mal field theory, reinterpreting the results of conformal field theory from group 
theory. We see that the spin fields given in (4.6.13), under the SO(10) Kac— 
Moody Lie algebra, transform as a 32-component spinor. Likewise, the NS 
anticommuting vector fields given in (4.6.10), under the SO(10) Kac-Moody 
algebra, transform as 10-component vectors. We can also view the b, c, B, y 
ghosts of conformal field theory as forming representations of the superconfor- 
mal group with different values of the central term given in (4.4.43). Finally, 
we can also view the correlation functions, which are the heart of conformal 
field theory, as the Clebsch—Gordon coefficients found in the different tensor 
products of various representations of the SO(10) Kac—Moody algebra coupled 
with the superconformal group. 
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4.8 Supersymmetry 


Finally, we can now construct the supersymmetry operator in this theory that 
allows us to go back and forth between the fermion and the boson sectors of 
the theory. Because V4.)/2 has conformal weight |, this means that its integral 
is an invariant under the conformal group. Thus, if we take k = 0, we find the 


supersymmetry operator 
1 
=.) ae 4.8.1 
Q41/2,0 =f zVei/2. ( ) 


Depending on whether we take the + or — fermion vertex, we find 
Oa Sce0 
Di jae = eS? (yee. (4.8.2) 


These operators indeed have the property of converting the fermion sector into 
a boson sector, and vice versa: 


[Qa, Ve(é, kz], = Vall =u GQ omak, 2s 
[Ou, Va(l,k. z)] = Vea? =k" oy), kee). (4.8.3) 


Once again, however, we are faced with the problem of having an infinite num- 
ber of such supersymmetry operators, which link the various inequivalent Bose 
sea states together. However, because of arguments given earlier, we do not 
have to worry about this. Although there is an infinite number of inequivalent 
Bose vacua, the matrix elements between them yield the same answer. 


4.9 Summary 


The great advantage of the superconformal field theory is that we have the 
best features of the NS-R and GS models. The superconformal field theory 
has the advantage that it is manifestly covariant, based entirely on free fields, 
possesses a covariant supersymmetric generator, has spin fields of weight 2 
and fermion vertices of weight 1, and is easy to manipulate. 

There are two drawbacks, however, to the conformal field theory. First, we 
must carefully keep track of the many bosonized ghosts. Fortunately, these 
ghosts are free fields and operate on different Hilbert spaces. Second, we have 
an infinite number of “pictures.” Fortunately, the final S-matrix is independent 
of whatever choice of “pictures” we take. 

The essence of conformal field theory is that we can calculate the correlation 
functions of various fields simply by knowing their transformation properties 
and their short-distance behavior. For example. the correlation function 


(elk X(w) gfkX(2)) —k? 


eo) 


can be calculated once we know the short-distance behavior of two string fields. 
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One of the achievements of the conformal field theory is the construction 
of a fermion vertex function with conformal weight 1. This vertex function is 
dependent on a spin field S,, which must transform as a genuine spinor under 
the SO(10) Lorentz group: 


_ ima ; 
j*’(2)Sq(w) ~ ry gales Oy Sw) eee (4.9.1) 


Other considerations, such as the field having dimension 3, allows us to fix the 
relations 


1 
as) ———— ye wae 9. 
Ww" (Z)Sa(w) Jie — wee Jo Sp(Z) + (4.9.2) 
LY enous | Sng SY Py, 
eo) (wr a + Vz - pn Mav) 


1 
ee 2) Fizyw'’(z) tes. (4.9.3 
Fier eV Ov'@) (4.9.3) 
Once we know all the short-distance behavior of the spin field, we can construct 
the vertex function that has weight 1. We need an extra piece that has weight 
:, which comes from the ghost sector: 


as 
Saas f az dd dOBDC +c.., - (4.9.4) 
where 
BZ) = Ob(z), 
| (z) = B(z) + 6b(z) (4.9.5) 
Cie — c(z) + ev iz): 
The ghost sector can be bosonized according to 
, Bp=e %d, 
= e?n. (4.9.6) 


This, in turn, finally allows us to write the vertex function with conformal 
weight | on-shell: 


Vai = wren Sate (4.9.7) 
which has conformal wei ght 
+23 4°’. (4.9.8) 


In order to write amplitudes that can contract with the —} vertex operator, 
we need a corresponding vertex function with +5, which is given by 


Vip = ut(e* [eax + Lik WW yp )apS? + 46°?” nbSy).(4.9.9) 


With these two vertices, we can now construct multifermion scattering 
amplitudes. 
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One drawback of this formalism, however, is that we have an infinite number 
of vacua and hence an infinite number of vertex functions. These vacua can be 
constructed as 


e797 10) = Iq), (4.9.10) 
weight: seq(q + Q). (4.9.11) 


This, in turn, allows us to construct an infinite number of fermion vertex 
functions. For example, 


V3;2 = [Qarst, €Vi/2], 
Vso = [Qsrst, & V3/2], (4.9.12) 
Ve 


are all acceptable vertex functions with weight |. This is an embarrassment 
of riches. However, it can be shown that, at least on-shell, all these vertices 
are equivalent to each other. Thus, we can take any one of these at will and 
construct the fermion—fermion scattering amplitudes. 

One of the great advantages of this formalism is that we can now construct 
the supersymmetry generator, which is nothing but the vertex function inte- 
grated over z. In fact, we actually now have an infinite number of inequivalent 
supersymmetry operators, all of them being equivalent on-shell: 


—(1;2) 
Q-1/2.0 = sae P 


Qi 2.0 = e¥ SF(y")gg0-X ,., etc. (4.9.13) 


Finally, it is possible to obtain a coherent overall picture of conformal field 
theory if we view it as one way of calculating matrix elements for infinite- 
dimensional Kac—Moody algebras and the superconformal algebra. A Kac 
Moody algebra has commutation relations 

(Tis be) = ee en, 
where the f’s are the structure constants of a finite-dimensional Lie algebra. 
For level-one simply laced algebras, we can construct the basic representation 
of the Kac-Moody algebra. The basic representation is given in terms of vertex 
operators first found in string theory. 

In this formalism, the spin fields $, and the NS-R y fields can be viewed 
as spinorial and vector representations of an SO(10) Kac—Moody algebra, 
and the various (b,c) and (B, y) ghosts can be viewed as forming different 
representations of the superconformal algebra. Correlation functions can then 
be viewed as Clebsch—Gordon coefficients for the combined SO(10) Kac— 
Moody and superconformal algebra. 
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CHAPTER 5 


Multiloops and 
Teichmuller Spaces 


5.1 Unitarity 


One of the most exciting aspects of string theory is the possibility of a theory 
of gravity that is totally finite and therefore independent of conventional renor- 
malization theory. String theory may provide a framework in which, for the 
first time, a finite theory of quantum gravity may emerge. What 1s particu- 
larly fascinating is the mechanism by which the cancellation of all potential 
divergences takes place, namely the use of topological arguments to eliminate 
certain divergences. Once again, we see the enormous power of symmetry that 
is built into the string model. We will show, for example, that the diagrams 
that are potentially divergent are topologically equivalent to the emission of an 
effective dilaton. Therefore, by eliminating the dilaton from the theory, we ob- 
tain a theory without any apparent divergences. Thus, mechanisms that never 
before appeared in point particle quantum field theory are responsible for the 
elimination of potentially harmful graphs. 

So far, we have developed only the first quantized theory of interacting 
strings without loops. This, of course, cannot yield a unitary theory. The Eu- 
ler Beta function, we saw earlier, has poles on the real s-plane axis, without 
imaginary parts or cuts, and therefore the theory describes only tree diagrams. 
Early attempts were made to modify the original Beta function by adding an 
imaginary part to the mass of the resonances: 


ay 
i i—_ )° — (5.1.1) 
DF == M3 + iT, 


but then the marvelous properties of the Beta function were inevitably 
destroyed. 
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The correct proposal to unitarize the model was finally made by Kikkawa, 
Sakita, and Virasoro (KSV) [1] in 1969 by adding loops, treating the Beta 
function as the Born term in a perturbative approach to defining the S-matrix. 
The actual multiloop amplitudes were calculated by Kaku, Yu, Lovelace, and 
Alessandrini [2-8]. (We should point out, however, that the KSV unitarization 
program breaks down when we discuss M-theory. For membranes, it is not clear 
how to construct even the free theory, let alone the theory of interacting mem- 
branes. D-branes have proved useful in understanding the excitations found in 
M-theory, but a complete understanding of the interactions in M-theory is still 
lacking.) 

To understand how the perturbation series is set up, let us beygin with the 


time evolution operator U, which transforms an initial state at t = —oo into 
the final state at t = oo. The S-matrix is the matrix element of U: 
Dye (Co co)i|y es (Sale) 


Because the time evolution operator is unitary, the S-matrix itself is also 
unitary: 


StS =SSt=1. (5.1.3) 
In matrix form, this reads 


SiS |n) (n| St) = 8, (5.1.4) 
n . 
where the n’s are a complete set of intermediate states. If we separate out the 
state that corresponds to no scattering, we get the JT matrix: 


S=1-iT. (a5) 
Then 
- 1G aed A oy BY aa (5.1.6) 


If we take matrix elements for the scattering of the multiparticle initial state 
(i| going to the multiparticle final state | j), then we have 


Im = —5 DIT nl) (5.1.7) 


(see Fig. 5.1). If we represent the four-string scattering amplitude as (/| T |k). 
then clearly we must combine various four-point functions to obtain the next 
order in the perturbation series. Because strings can “twist” as they sweep out 
a two-dimensional surface, the set of Feynman diagrams for the loops is larger 
than for simple planar diagrams. In fact, as Fig. 5.2 shows, there are three types 
of diagrams that we can construct using the optical theorem. 

For the open string, the interactions sweep out a world surface that is topo- 
logically equivalent to a disk with holes. In addition, we can also have “twists” 
in the disk. (A twist is created when we cut a line between two holes and then 
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Be: 


FIGURE 5.1. Unitarity of the S-matrix. The imaginary part of the scattering ampli- 
tude is proportional to the square of its absolute value. In this fashion, we can construct 
higher loops from lower-order tree diagrams. This was the original unitarization 
scheme advocated by Kikkawa, Sakita, and Virasoro. 


EX x 
rant 
rt 


FIGURE 5.2. Planar, nonplanar, and nonorientable single-loop open string diagrams. 
Unitarity forces us to sew together three types of graphs. (The x on the string 
corresponds to twisting the leg.) The nonorientable diagram corresponds to a Mobius 
strip. The nonplanar diagram corresponds to placing some external tachyon lines on 
the boundary of the interior loop. 


" 


ii) 
i] 


rejoin the cut by reversing the orientation of the points along the cut.) The three 
types of open string diagrams are: 


(1) Planar diagrams, which are topologically equivalent to a disk with N holes 
punched in the interior with the external lines located on the exterior edge. 

(2) Nonplanar orientable diagrams, where the external lines can be located on 
some of the internal holes as well as the external edge, or where the holes 
overlap. 
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(3) Nonorientable diagrams, where we have an odd number of twists in the 
surface of the disk. The Mobius strip is an example of a nonorientable 
diagram. 


For the closed string, the tree diagram swept out by the interacting string 
is topologically equivalent to a sphere. There are two types of loop diagrams 
we can create out of the closed string. Let us cut 2N holes in the sphere and 
carefully mark the orientation of points along the circular edge of each hole. 
Then rejoin N pairs of holes to obtain a sphere with N handles. The two types 
of diagrams are: 


(1) Planar diagrams, where the orientation of the circular edge of each pair of 
holes is preserved when we resew the diagram. A doughnut, for example, 
is a planar single-loop diagram. 

(2) Nonorientable diagrams, where the pairs of points along the circular edges 
of the holes are joined by reversing the orientation of the holes. A Klein 
bottle, for example, is a nonorientable diagram (in Fig. 5.3, we see how 
Klein bottles can be assembled from flat two-dimensional surfaces that 
have their boundaries identified). (We note that only Type I strings, which 
carry no orientation or direction, have Klein bottles in their perturbation 
series. Type II strings carry an intrinsic orientation and cannot produce 
Klein bottles.) 


In the functional formalism, we saw earlier that all tree diagrams were con- 
structed by calculating the Newmann function for the disk or the sphere [9-1 1]. 
The simplest method of calculating this Newmann function was to conformally 
map the disk or sphere to the upper half-plane or the entire complex plane. We 
then borrowed a method developed in electrostatics, the method of images, to 
write the Neumann function: 


G(z, z) = In|z—z'| + In|z — 2’. (5.1.8) 


Now, we will generalize this discussion to Riemann surfaces with holes. For- 
tunately, mathematicians long ago wrote down the Neumann function for the 
disk and sphere with N holes. In fact, Burnside [12] solved this problem in 
1891! The solutions to this classical problem are given in terms of automorphic 
functions, which we will now analyze. 


5.2 Single-Loop Amplitude 


First, let us set up the functional integral for the single-loop diagram: 


N 
Ae | DK dive? pee (5.2.1) 
Ss: ° 


i 


where we functionally integrate over a horizontal strip in the complex plane 
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FIGURE S.3. The Klein bottle. By identifying the opposite sides of a rectangle. we 
can obtain either a torus or (if the orientation of the sides is reversed) a Klein bottle. 
shown here. The Klein bottle is a two-dimensional closed surtace with only one side. 


(see Fig. 5.4) that has finite length and then identify the left and right edges. In 
this way, we construct a surface topologically equivalent to a disk with a hole. 
The functional integral, of course, can be calculated explicitly, leaving us with 
the factor 


exp De kiN (zi, 2))k i| (5.2.2) 


where N is the Neumann function. 

Now, let us topologically deform the horizontal strip into the following 
surface. Consider an annulus, defined as the region in the upper half-plane 
that has an outer radius of r;, and an inner radius of r, and a ratio w = rp/ry. 
Now impose the fact that the outer perimeter is to be identified with the inner 
perimeter. This means that a point z on the surface of the outer perimeter is to 
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3 


FIGURE 5.4. Conformal surface of the single-loop open string diagram. In the o- 
plane, the surface corresponds to a rectangle of width z and of arbitrary length, 
such that we identify the ends. Rectangles of constant width with varying lengths 
are all conformally inequivalent, so we must integrate over all lengths in the path 
integral. The wavy lines correspond to “zero width” strings or tachyons, which may 
attach to the surface on either the upper or lower boundary. In the z-plane, the 
surface corresponds to a narrow tube that is bent into a half-circle, with external 
lines emanating from the ends. 


be identified with the point on the inner perimeter with the same polar angle: 
Z—> wz. (52253) 


This identification creates a semicircular tube in the upper half-plane. The 
conformal mapping from the horizontal strip to this tube is just the exponential. 

A tube, in turn, can be mapped into a disk with a hole by stretching one 
end of the tube until it becomes a large circle and then shrinking the other 
end. When we construct Neumann functions in this annulus, we obviously are 
interested in functions that have the property 


w(z) = w(wz). (5.2.4) 


If we take arbitrary powers of w, this identification actually divides the upper 
half-plane into an infinite number of concentric circles. Each concentric circle 
has radius w”. By identifying the outer perimeter of one annulus with its inner 
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perimeter, we can create an infinite succession of tubes. Thus, the entire upper 
half-plane can be decomposed into an infinite sequence of these tubes, but we 
are interested in only one of them. 

Thus, we have divided up the upper half-plane by making an identification of 
concentric circles generated by multiplication by the number w. This number w 
thus parametrized the disk with a hole. This is called the Zeichmiiller parameter 
for the single-loop diagram. These parameters have a natural generalization to 
surfaces with N holes. 

In general, we can also divide up the upper half-plane by using an arbitrary 
projective transformation, which, as we saw earlier, maps the real axis into 
the real axis. (In general, circles are mapped into circles under a projective 
transformation.) We define an automorphic function as one that has the property 


v(z) = vz’), (5.2.5) 
where we make a projective or SL(2, R) transformation: 
maar o 
SL(2,R): 4° cz+d’ (5.2.6) 
ad —bc=1, 


for real a, b, c, and d. (Phases can sometimes enter into the periodicity prop- 
erties of these functions.) Fortunately, mathematicians have calculated this 
function. The Neumann function is 


N(z, 2’) = In| wW(z'/z, w)| + In |W(Z'/z, w)I, Cee) 
where we demand, up to a phase, that the function be periodic: 
w(z, w) = W(wz, w). (5.2.8) 
Explicitly, this periodic function is 
In w(x, w) = In(1 — x) — 3 Inx + ne : 
s 2 In w 


55 
+) [In — w"x) + In(l — w"/x) — 21n(1 — w")] (5.2.9) 
n=] 
Notice that this function explicitly has the required periodicity property 
mentioned earlier. Thus, when we exponentiate the Neumann function, the 
momentum factor in the integrand contains factors like 


N 
[]][]q@ - wep. (5.2.10) 


n i<j 
This function can also be rewritten in terms of Jacobi theta functions: 


©,(& | t) 


Ais => —2 ] J Y 
W(x, w) mi exp(ig Olay’ (5.2.11) 
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where 


Inx 


2ni’ 


iw 25 
=e (5.2512) 


There are four Jacobi theta functions, given by 


O\(v|t) =i 3s (—1) tg? gin en—bw 


n=—-CO 


oo 
= 2f(q*)q'“ sina | [a — 2q?" cos2nv + q*"), 


pial 


O2(v | tT) = O,(v + 4 | 2), (5.2.13) 
@3(v | t) = Da g” enn 


lo. ¢) 
= f(q)| [G+ 297"! cos2xv + q*"), 


=| 


Og(v | tT) = O3(v + 3 | tr) = ie HMO (vy + 47] 7), 


where 
fae (la) ge", (5.2.14) 


is the partition function. 
The amplitude is thus 


i Aloop = [mf @* ra T]T]o _ eee! 


eb <al 


Using conformal invariance, we can also determine the factors A and dy. 
However, we will find it convenient simply to summarize the results of the 
harmonic oscillator approach, which also arrives at the correct integration 
measure. 


5.3. Harmonic Oscillators 


The most convenient language in which to discuss the planar, nonplanar, and 
honorientable single-loop diagrams is the operator language, which, of course, 
is a specific representation of the functional integral. Let us begin with a strip 
in the complex plane t + io, the sets of all points from o = 0 to 7. Now place 
the interaction with external lines along the x-axis. As before, if we take this 
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to be our surface for the functional integral, then we can insert a complete set 
of intermediate states everywhere: 


Ix) f DX, (X= 1, (5.3.1) 
and we obtain for the functional integral 


(5.3.2) 


sss f DX, (XL VolXa) f DX2 (al -—> 
is 

As before, we note that the above expression for the functional integral is 
possible because the Hamiltonian is diagonal in the harmonic oscillator basis. 
By eliminating the functional integration at all intermediate points along the 
strip, we obtain the first planar loop amplitude, in operator language: 


Ap = [ 4?p TVD VD: -- VoD). (5.3:3) 


Carefully choosing the gauge for the multiloop calculation is crucial. For the 
tree amplitudes, we saw in (2.9.5) that ghost states do not couple to trees, so 
they can essentially be ignored. However, ghost states do couple to factorized 
trees because of (2.9.9), so they can propagate internally in a loop unless they 
are carefully eliminated. An identical problem occurs when quantizing Yang- 
Mills theory (which is not surprising, since both Yang Mills and strings are 
gauge theories.) 

There are three standard ways of eliminating the ghosts in loop amplitudes. 
First, we can insert projection operators which explicitly remove the ghosts 
from the Hilbert space. However, this technique is rather cumbersome and pro- 
hibitively difficult for fermion loops. Second, we can allow Faddeev—Popov 
ghost states to propagate and cancel the ghost states. Using BRST and confor- 
mal field theory, even higher fermion loop amplitudes can be calculated. And 
third, we can use the light cone gauge, which we will use in this chapter. 

The advantage of choosing the light cone condition is that it vastly simplifies 
the calculation. Ghost states, for the most part, can simply be ignored. 

In addition to the light cone gauge, we will also choose a specific kinematic 
frame for the external legs of a multiparticle amplitude: 


Ft); (5.3.4) 


(We should caution the reader that certain complications exist with respect 
to the kinematics of enforcing this for an arbitrary number of external particles 
with arbitrary spin. For example, we may have to perform analytic continua- 
tions of the external components of momenta in order to preserve this frame. 
A careful analysis shows that we can always choose this frame for fewer than 
26 external boson legs and less than 10 supersymmetric legs. There are no 
inherent problems, however, with the light cone approach because we can al- 
ways perform a Lorentz rotation on the vertex functions to a frame where the 
+ components are nonvanishing. The light cone gauge is compatible with ar- 
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bitrary values of the + components of the momenta. As a result, we will omit 
further discussion of this delicate point in our discussion.) 

Fortunately, this trace is easily evaluated using the coherent state formalism 
in (2.6.18) that we used for the tree diagram: 


N 
An =| [ax [ap Tr[Vo(k1, x1) Vo(k2, X42) +++ Vo(ky, X1X0-°+xXy)wh? 7], 


i—) 
(5.3.5) 
where 


Wi == 42+ * XN. (5.3.6) 


The trace can be explicitly calculated using coherent state methods. We use 
the identity 


Tr(M) = -|/ d*re7*P (A) Ma). (5.3.7) 
Carefully working out the different factors, we have 
Ay = fare [[axat’*7, (5.3.8) 
i=] 
je f(wy- D I] Il [saan i (5.3.9) 
ae wey _ 


where 
Pi =X 42° * "Xi; 
fw) =[][a-w”, (5.3.10) 


p= 


Cji = Pj/Pi- 


We can also explicitly perform the p integration: 


D Des up oe 
D (1/2)p? 2m =1/2 ji 
fa le Sea le G o0| Fi 


Combining everything together, we have [13, 14] 


1 N-1 i eee ae . 
Aw anf T] eerimmrdn f “+ ( = ) fGen | |\wan”. 
OR 0 . 


In w a 
(53.12) 


kik, 


(S2o5 11) 


Wei = W(cj, wv), 
vy, =Inp;/Inw, (5,3-13) 
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q= eer /inw 
and the 6 function here simply orders the various v; factors along the real axis. 
This is the final result for the planar single-loop amplitude. Notice several 

features about the result: 

(1) As predicted by the path integral method, the integrand is an automorphic 
function. 

(2) The measure is easily evaluated in the harmonic oscillator approach. (The 
calculation is a bit harder in the path integral approach.) 

(3) The integral diverges at g = 0, which corresponds to the inner hole shrink- 
ing to zero radius. The divergence is a mild one and can be eliminated once 
we add superpartners to the strings. 


For future reference, we will define the planar (P), nonplanar (NP), and 
nonorientable (NO) integrands at the same time: 


_l-x In? x \ © [(1 — w"x)1 — w"/x) ; 
voir) = AZ exp (5%) fy] [ See, (5.3.14) 


Oe ic: ln (1 — w"x)(1 + w"/x) . 
Wnp(x, w) = WV: exp (z a lS we . (Set) 


a In; x (lL — (—w)"x)(1 — (—u)"/x) 
Wno(x, w) = Vr (2) FI d= (=e yy? 2 
(3.3.16) 


These functions, in turn, can be reexpressed in a form in which their link with 
Jacobi theta functions is more ci 


—2 ] = 2q? Z ) 4n 
we(x, w) = mend sinzv| | ( saa tS ie ) 3.47) 
Ing ( =e SO ar 
—4 1 —2(- & ) . 
Urol, wy = = satonf | (va) ae am! Hh (5.3.18) 
In : CU = yr 
[=o 2n-1 4n—2 
Wnp(x, Ww) = a lie q) = | .  (S58.89)) 
Written explicitly in terms of theta functions, we have 
li, 
Wp(x, w) = —27i exp ( oe ) 
2Inw 
Inv | Inw Inw 
Qe, OQ)! a 
x (> | ai ) | (0 Oni i (5.3.20) 


In? 
Wre(x, w) = 27 exp ls e ) 


nw 
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Inx | nw ln 
On = ho! . 
° (= 2ni ) ei (0 — i (5.3.21) 
. In? x 
Wno(x, w) = —27i exp 
2Inw 
Inx |Inw = 1 Inw 1 
© lage eg fel i 
es (= aia. ae (0 oe *): (5.3.22) 


Now that we have calculated the planar single-loop graph, let us calculate 
the nonorientable single-loop function. The trace we want to evaluate is 


Ano = / d” pTr(QVoDVo--- VoD), (5.3.23) 


where {2 is the twist operator in (2.7.13). Notice that the twist can be placed 
anywhere along the chain, and nothing changes. The trace can be evaluated 
using the same coherent state techniques, and the major change is that w turns 
into —w. The final result is 


call Tene ie (a8 =) [ [@ noi 


i ey 


2.N 1 
d 
=| []ons - wav, f a 
Oe 0 


x (- Ing)” f(-q’?)™* | [noi - (5.3.24) 
i<j 
Notice that the integration region is twice the usual one from 0 to 1. This is 
because, for the external lines to go completely around the Mobius strip, the 
lines must go around the edge twice (see Fig. 5.5). This fact will eventually 
play a crucial role in the cancellation of anomalies in Chapter 8. 


B A B A 
—_— 
A B A B 
B B A 
—_—_—_ 
A A B 


FIGURE 5.5. Integrating around a Mobius strip. For the Mobuus strip, an external 
leg must travel twice the length of the strip in order to make a complete path around 
the boundary. Thus, the nonorientable strip has an integration region that has twice 
the range of the planar one. 


— > 
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The nonplanar diagram can also be evaluated. Notice that placing an even 
number of twists on the loop separates the external lines into two classes, those 
that revolve around the outer edge and those that revolve around the inner. The 
final answer depends on which lines are on the inner and outer edges. Let us 
place the twists so that the first to the Lth lines are split apart from the others: 


Anp = fee THS2DV) D.-- > VEO Vy > I). (5.3.25) 


The integral can again be performed, with the answer [15] 


TW ‘dq (-2n?\" 21-24 kik; 
Anp =|) [aw | al ) [f(q")] | [vei or wpa. 
R j=l Ome Ing i<j 
(5.3.26) 
where we use Wnp if the 7j lines are on opposite sides of the disk and Wp if 
they lie on the same side. The integration region for the lines reflects the fact 
that there are two disjoint regions of the disk. The external lines are integrated 
sequentially, except that there are now two disjoint regions. 
For convenience, let us now put all three amplitudes in the same general 
expressions: 


N-1 1 2\N 
dqi (= ) a: ame 
Aly dv; —= || ———— 5) hae 53227) 
J [ | | | iia, f5(q) | [ai ( 


1 i=1 q5 ej 
where 
J =P, NO; NP, 
Rp = {0 < vy) <--+ < vy = lj, 
Ryo = {0 < vy) <--- < vy = 2} (5.3.28) 


fo(q’) = fne(q’), 
fro(q’) = f(-4°), 
q = 4 = Qnp = Guo: 
and where the nonplanar region of integration reflects the fact that there are 
two disjoint regions of integration. 
Now that we have written explicit representations for the various open 


string amplitudes, let us draw some rather remarkable conclusions from these 
amplitudes. 


(1) Closed Strings from Open Strings 


One of the strange features of the nonplanar single-loop diagram is that it has 
more poles than those found by factorizing on the usual open string channels 
[16]. By examining the factor 


| 
i Jaq (5.3.29) 
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yor 


FIGURE 5.6. Emergence of the closed string theory from open strings. A curious 
feature of the open string theory is that, at the first-loop level, it already contains 
the closed string theory as a “bound state.” The nonplanar diagram for open strings 
can be stretched until it becomes a cylinder, which in turn can be factorized into two 
smaller cylinders. Thus, the intermediate state must be a closed string. 


(where the is factor comes from the momentum-dependent integrand) we find 
that there are extra poles at s/4 = —2, 0, 2, 4, 6, ..., which are precisely the 
locations of the poles of the closed string sector. Thus, the open string sector 
automatically contains the closed string sector. This can most easily be shown 
with dual diagrams by thinking of the nonplanar diagram being a cylinder, with 
two sets of external lines that can rotate around the top and bottom edges of 
the cylinder. However, by factorization, we can slice the cylinder horizontally, 
such that the intermediate state is a closed loop. Thus, the closed string emerges 
as a “bound state” of the open string sector (see Fig. 5.6). The closed string 
sector, by itself, is an entirely unitary theory. However, the open string sector, 
by itself, is not. We see that the presence of open strings demands the existence 
of closed strings as intermediate states, or else the theory is not unitary. In fact, 
as noticed by Lovelace [17], this unwanted singularity in the complex plane 
for the nonplanar diagram is actually a cut (which would be disastrous), but 
becomes a pole only in 26 dimensions. In fact, this was the first indication that 
the string model was consistent only in 26 dimensions. 


(2) Slope Renormalization 


Notice that the divergence of the planar diagram arises from 


| i 
i an ga (5.3.30) 
; 0 0 q 
This divergence at g — 0 corresponds to the hole in the disk shrinking to 
zero. This divergence arises from the fact that we are summing over an infinite 
number of intermediate states propagating in the interior of the loop. 
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This is not, however, the ultraviolet divergence that we usually associate 
with Feynman diagrams. For point particles, the divergences of the various 
amplitudes arise when we deform the local topology of a particular diagram, 
such that a propagator shrinks to a point. Thus, divergences of Feynman graphs 
are associated with deformations of the local topology of the graphs. 

In string theory, however, because of conformal invariance, we cannot pinch 
a propagator to a point. Thus, conformal invariance on the world sheet rules 
out ultraviolet divergences. However, we still have the infrared divergence of 
the interior points shrinking to zero. 

Again, conformal invariance tells us that we can always map the shrink- 
ing hold into a dilaton or tachyon vanishing into the vacuum. We can always 
‘‘pinch” this shrinking hole and extract a closed string resonance with vacuum 
quantum numbers vanishing into the vacuum. Thus, conformal invariance gives 
us an entirely new interpretation of the divergences of string theory. This new 
interpretation associates with each divergence a closed string state “pinching” 
off from the hole, with zero momentum, corresponding to tachyons for the q ~* 
divergence or dilatons for the g~! divergence vanishing into the vacuum (see 
Pigs. ds 

When we pinch the shrinking hole and extract a closed string state with 
vanishing momentum, we notice that the remaining diagram looks just like a 
tree without the hole. Thus, by extracting out the divergent pole contribution. 
what we have left is a finite tree. This, in turn, allows us to treat the divergences 
as a redefinition of the free parameter, the Regge slope a’. This process, called 
“slope renormalization,” works fine for the dilaton g~' pole, but it is not clear 
how to handle the tachyon pole contribution g~*. Thus, the bosonic theory 
might not be totally free of divergences. 


(3) Finite Superstrings 


Experience with supersymmetric loop calculations for point particle theories 
has shown that the internal bosonic line cancels against the internal fermion 
line, yielding amplitudes that are much less divergent than expected. The same 
thing occurs for superstrings. Let us now turn to the superstring graphs, where 
we will be able to perform this “slope renormalization” { 18. 19] explicitly. We 
will find that the q~* divergence in the open string Type I theory cancels by 
itsclf (this 1s also expected because the theory has no tachyons). This leaves 
only the g~! pole, so slope renormalization is possible. The truly remarkable 
thing, however, is that the Type II theory is actually finite by itself, without any 
slope renormalization! 


5.4 Single-Loop Superstring Amplitudes 


The single-loop open superstring can be calculated in cither the GS or the 
NS-R formalism. In the GS formalism, we might use the light cone formula- 
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FIGURE 5.7. Emission of the dilaton. By conformal invariance, we can deform the 
single-loop open string diagram and pinch the interior loop. The resulting diagram 
corresponds to the emission of a tachyon or dilaton into the vacuum, which results 
in an infrared divergence. Superstring theory is constructed so that these poles do 
not exist, so the theory is formally finite. 


tion, because a satisfactory covariant one does not exist. The advantage of this 
formalism, however, is that the graphs are manifestly spacetime supersym- 
metric. In the NS—R formalism, we must either use a projection operator to 
extract out the ghosts or use the BRST techniques that allow the ghost to prop- 
agate and cancel the negative metric ghosts. Unfortunately, supersymmetry is 
not manifest until we add bosonic and fermionic loop contributions separately 
and insert the GSO projection operator into each loop. 

Let us use GS formalism developed in Section 3.9, where supersymmetry 
is manifest. As before, the massless vector boson vertex is 


Va(C, k, v) = ¢' B'(z):e**:, (5.4.1) 
_where 

B (gy (0) FR (z), 

R (rz) = 4S(z)y* Sx), (5.4.2) 


co 


Gi Se 
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where we have also set ¢* = 0 for the polarization vector of a massless boson. 
The fermion vertex is 


Ve(u, k, T) = aF(x)u:e’**:, (5.4.3) 
where 
F(t) = i(pty'(S(t)y « P(t) — $:R"(t)k'S(t)y/3)*. (5.4.4) 


Let us consider only the trace over external bosons. There are a great many 
simplifications. For example, the trace over the Sp operators requires at least 
eight of these operators. Thus, amplitudes with two and three external legs 
vanish all by themselves. As a result, there are no self-energy and vertex 
corrections at all in the theory. 

The first nonzero amplitude occurs for four external legs. Even then, the 
amplitude actually vanishes, except for the contribution from the term 


Ch Ren. (5.4.5) 
In fact, the only diverging contribution to the trace is 
Tr(wX/2ms-n¥° Sr) = F(w)?, (5.4.6) 


As expected, this cancels against the other contribution coming from the boson 
loop. Thus, the final single-loop open superstring amplitude is 


eS 1 
d 
A\oop = Kf [ [@@u ae wdvy | = | lea. (5.4.7) 
Or 0 


fica 


where K is the same kinematic factor (3.9.11) found in the tree graph with 
external massless boson legs. As expected, the graph diverges only as q7' 
because there are no tachyons in the theory to give us a g 7 divergence. 

Let us now extract out the infinite piece of this graph. By carefully extracting 
the finite piece near gq = 0, we find that the y function reduces to an ordinary 
sine function, so that the finite piece A becomes 


13 
Aloop = x | | [eer — v7) | [ [sin m(vy — ver dv;. (5.4.8) 
0 7=1 


1 Bree | 


To show that we can pertorm slope renormalization on this graph, let us actually 
perform the v2 and v; integrations. If we define 
sin 1(v2 — Vv) sin V3 
SS (5.4.9) 


sin 7(v3 — v}) sin V2 


then, after the integrations, we find 


= le? Aner In(l — x rn . 
Aicap ~ K| ( + —) ea =<) ae (5.4.10) 
0 
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It is easy checked that this, in turn, is precisely the derivative of the Born term 
with respect to the slope. Thus 


a 
(a’) aa’ tree: 


Notice that we can absorb this divergence into a renormalization of the slope. 
This is our desired result, that a redefinition of the Regge slope can render the 
Type I superstring theory renormalizable at the first loop. 

Now, let us consider the case of the closed string amplitude, where even 
more surprises take place. 


(5.4.11) 


A loop ~ 


5.5 Closed Loops 


It is again a straightforward process to calculate the single-loop amplitude of 
the closed bosonic string. Let us stress the differences: 


(1) The world sheet of the string, which was topologically equivalent to the 
upper half-plane for the open string case, now becomes the entire complex 
plane. 

(2) The propagator must contain an integration over o so that it is independent 
of the origin of o space. 

(3) We must sum over different orderings of the vertex functions. 

(4) The external lines, which were once attached to the boundary of the strip, 
are not attached to the interior of the complex surface. 


In Fig. 5.8, we see the horizontal strip defined from 0 = 0 too = 27, such 
that the top and bottom edges are identified with each other. For the single 
loop, we must now also identify the left and right edges. By the exponential 
mapping, we then map the finite horizontal strip to the entire complex plane. 

By the usual coherent state methods, we find [20] (taking the slope a’ = ‘) 


i = [ arptevioy. suse Vy D) 


» 2a (1/2)ki-k 
_ med —48 NaS. 5.5.1 
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i<j 
where 
v; = (2mi)' Inzz2°++z;, 
Vii = Vy — Vi, 
T= vy = (277i) 'Inw, (55:2) 
Us 92127 EN 
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FIGURE 5.8. Conformal surface for the single-loop closed string diagram. In the p 
plane, the surface is a rectangle of width 27 and arbitrary length, with the opposite 
edges identified. In the z-plane, the surface corresponds to a doughnut. External legs 
can attach themselves to any point within the surface. 


and 
(gle cin = Ee pitied (| — itil) 
w= ale ee 
ew) oP Din fw os TI (l- w"y 
Xiy = X(Cji, w) 
— ee 2 _ Ju. 
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Let us rewrite this as 
A= f @cam t) °C(t) F(t), (5.5.4) 


where 
C(t) = 4G ing) re es) 
N~1 
tty = mw’ Im rf al a V; iG 
i i<j 
Notice that the integrand is doubly periodic: 
Xv 1, et) = Xr 7). (5.5.5) 
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This is easy to see because the theta function has the following properties: 
O(v+1]t)=O(v{ 7), 
Ovtr | tr) = —e 7M Oi (v | 7). . (5.5.6) 


This is important, because it shows that a naive integration over the v vari- 
ables will drastically overcount the proper region of integration. Consider the 
parallelogram formed by the origin and the points 0, 1, t, and 1 + t. When 
opposite sides are identified, this becomes topologically equivalent to a torus 
or doughnut. Notice that the double periodicity divides up the complex plane 
into an infinite number of these parallelograms. Thus, we want to integrate 
over only one parallelogram, or else we will have infinite overcounting. Thus, 
for fixed t, we must restrict the integration over the v variables or else we will 
be integrating over an infinite number of copies of the same thing. We will 
choose the following truncation: 


0 < Imp; < Imr, 


—} <Rev; < }. (55,7) 


Surprisingly, in addition to this truncation in v space, we must perform yet 
another truncation in t space. The integrand of the closed-loop diagram is 
actually invariant under yet another transformation, given by 
, at+b 
tT = ——., 
Coad 

where a, b, c, d are all integers and ad — bc = 1. This generates what is called 
the modular group SL(2, Z). Let us show that the integral is invariant under 
this transformation: 


(5:5,8) 


ome (6m d\n dc 
Imt — |ct + d|7 Imt. (5.5.9) 
Therefore the following are actually invariant under this modular transforma- 
tion: 
Ce 
Im?t Im?z 
Now let us calculate how the other terms transform under a modular 


transformation: 
RR aoe pa \ct sed ae me ett (505.11) 


Thus, 


(5.5.10) 


C(t) = C(@). (5.5.12) 


’ Finally, we also have 


v at+b = 
eee oe eer td D1): (5.5203) 
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Thus, 
Fiz) = FO): (5.5.14) 


The integrand is therefore invariant under a modular transformation, which 
was first pointed out by Shapiro [20]. But what is the intuitive meaning of this 
symmetry? 

The Polyakov action tells us that we must integrate over all conformally 
inequivalent surfaces. We first observe that two parallelograms with different 
values of t are conformally inequivalent when we identify opposite sides. 
Thus, naively we expect that the integration over tT automatically integrates 
over all conformally inequivalent surfaces. However, this is not the case. 

There are actually two kinds of reparametrizations of a surface that must be 
carefully distinguished. The first are the reparametrizations that can smoothly 
be deformed back to the identity, i.e., the set of smooth reparametrizations 
that contains the identity map. The second is the set of reparametrizations that 
cannot be smoothly deformed back to the identity. Global diffeomorphisms are 
of this category. For example, take the parallelogram and identify only one set 
of opposite sides. This creates a tube. Normally, we would bring the two ends 
of the tube together to make a torus. However, now twist one of the open ends 
of the tube by 27 and then resew the ends together. By examing the resulting 
surface, we find that this has caused a genuine reparametrization of the surface, 
but that the identity map cannot be represented in this way. This twist is called 
a “Dehn twist” and it generates a discrete group. For the torus, it can be shown 
that the group generated by Dehn twists is the modular group Sp(2. Z). 

Concretely, the integral is invariant under t ~ —1/t and t > t + 1. By 
successive applications of these two transformations, we can show that we 
generate the entire modular group. But carefully examining the effect of these 
two transformations shows that they simply interchange the boundary of the 
parallelogram, generating Dehn twists. 

In summary, this second symmetry, called modular invariance, results from 
the fact that we must gauge-fix not only reparametrizations that can smoothly 
reach the identity map but also global diffeomorphisms that are not connected 
to the identity map. Thus, we must divide up the complex t-plane so that we 
only integrate over one surface invariant under the transformations T— — t+] 
and t -» —I1/t. Thus, the complex t-plane is divided up into an infinite set 
of redundant copies. To eliminate this infinite overcounting, we will take the 
following fundamental region of integration: 


—4<Rer< 


fundamental region = { Imr > 0, (55.15) 
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FIGURE 5.9. Fundamental region for the single-loop closed string amplitude. Mod- 
ular invariance of the amplitude divides the complex plane into an infinite number of 
equivalent regions. Thus, we must choose only one such region, or else the amplitude 
is infinite. The most convenient region lies between Ret = —} and +4 with |r| 
being greater than one. 


We will shortly see that modular invariance is perhaps one of the most 
powerful tools we have in checking for the self-consistency of new string 
compactifications. 

As we said, the effect of a modular transformation is to perform a Dehn 
twist, 1.e., shuffle the boundary conditions on the parallelogram. For example, 
if we have a string defined on the parallelogram labeled by X(o), 0), then a 
modular transformation changes the boundary conditions by 


X (0), 02) > X(ao; + bon, co; + dor). (5.5.16) 


Specifically, we can check that the transformations tT > t+1 andt > —1/t 
change the boundary conditions in the following fashion: 


le eee —> X(o; + 0,0»), 


5,511 7/ 
X(Gy,07) > ~~ X(Gy, 04). ( ) 


t—> -l/t 
Now let us analyze the divergence structure of the closed string amplitude. 
We first note that 


x(v, T) > 2z|v] (3313) 


as v —> 0. Thus, poles occur, as expected, in the amplitude when external lines 
coincide. The poles occur at s = —8, 0, 8, 16, etc. This divergence can be 
interpreted as coming from the self-energy diagram on the external leg, which, 
unfortunately, is on mass shell. 

Now let us generalize this calculation to the Type II superstring, which can 
easily be evaluated using the same techniques. The vertices all have the form 


W=VV, (5.5.19) 


where the V’s correspond to open string vertices (with half the momentum) 
of the two sets of oscillators. Once again, the two and three point loops vanish 
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because of the trace over the So modes. The trace can be evaluated exactly as 
in the open string case, except we now have double the number of oscillators. 
The answer for the boson scattering amplitude is 


Ayn =e / d’t(Imt) °F s(t), (5.5.20) 
Ee 
where 
3 
Fs(t) = (mt)? / [[@u | (Gea (5.5.21) 
f=1 Tag 


Notice that the factor C(t) is missing and that the power of Im t has changed. 
The remarkable thing is that this amplitude is totally finite! This finiteness is 
due to several factors: 


(1) The absence of two and three point amplitudes makes it impossible to 
place tadpole or self-energy insertions on the external lines. Thus, we do 
not have the poles found earlier for the closed bosonic string. 

(2) We can reduce the divergence of the graph by taking a fundamental domain 
free of divergences. 

(3) There are no contributions from tachyons vanishing into the vacuum, since 
there are no tachyons. 

(4) Fermion internal lines cancel with boson internal lines to reduce the 
divergence of the graph. 


5.6 Multiloop Amplitudes 


The multiloop function can also be written explicitly in terms of path integrals 
over Riemann surfaces with holes. The main problem with constructing these 
amplitudes is the choice of parametrization of the Riemann surface. Four types 
of parametrizations have been developed for the multiloop amplitudes: 


(1) Schottky groups. Multiloop amplitudes, which were originally calculated 
in this formalism [2-8], are discussed in this section. There are several 
advantages to this parametrization of a Riemann surface. First, it is explicit. 
There is no guesswork in the choice of integration variables, which are 
known exactly. The integration variables, in fact, are intuitively related to 
the topological structure of the Riemann surface. Second, the amplitudes 
factorize (because this representation was originally calculated by sewing 
together multiresonance vertex functions). Thus, unitarity can be shown. 
The disadvantage of this formalism, like other formalisms, is that modular 
invariance is not obvious at all. The integration region must be truncated 
by hand. 

Constant curvature metrics. The formalism, which will be discussed in 
the next section, is based on Riemann surfaces with constant Gaussian 


@ 


— 
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curvature. The advantage of this formalism is that it arises naturally when 
quantizing the Polyakov action. The other advantage is that a consider- 
able mathematical literature exists for Riemann surfaces with constant 
curvature. The disadvantage of this approach, as in the Schottky repre- 
sentation, is that modular invariance of the higher loop amplitudes is still 
obscure. The integration region must be truncated by hand. Unlike the 
Schottky representation, however, explicit representations of the 6N — 6 
modular parameters for arbitrary surfaces with constant metrics are rare. 
Furthermore, because this formalism is not derived by sewing together 
three-resonance vertex functions, factorization and hence unitarity are not 
obvious. 

(3) Theta functions. This is perhaps the most natural formalism, because 
modular invariance is built-in from the very start. We will discuss it in 
Section 5.11. This method is based on generalizing the theta functions 
introduced in (5.2.11) for the single-loop amplitude to include functions 
which are quasi-periodic in several variabies. The natural integration vari- 
able is the period matrix 2;; itself, which can be defined for any Riemann 
surface. This formalism can also be easily extended to include theta func- 
tions defined on various spin structures. Although this formalism holds 
much promise and is the subject of much research, there are also severe 
drawbacks. For example, beyond three loops, the period matrix becomes 
an extremely awkward method of parametrizing moduli space. (This dif- 
ficulty of parametrizing moduli space by the period matrix beyond three 
loops is called the “Schottky problem.” Only recently have mathematicians 
solved this problem. Unfortunately, the solution is highly nonlinear, and 
much more work has to be done to develop the formalism beyond three 
loops.) As in the previous formalism, factorization (and hence unitarity) 
are obscure. Higher loop theta functions are constructed by making edu- 
cated guesses and appealing to the uniqueness of the final result, not by 
sewing together vertex functions. 

(4) Light cone formalism. Because the light cone method is based strictly on 
physical variables, without any ghosts, this formalism is manifestly unitary 
and hence we intuitively expects that it automatically provides one cover 
of moduli space. This conjecture, which was never considered before by 
mathematicians in their study of moduli space, has recently been proven 
to all orders. We will only briefly discuss the light cone formalism at the 
end of this chapter because this formalism will be further developed in 
the next chapter in the context of developing the field theory of strings. 
The advantage of the light cone formalism is that it is manifestly unitary, 
factorizable, modular invariant, and easily generalized to a genuine second 
quantized field theory. The disadvantage is that it is obviously gauge-fixed. 


’ All four formalisms, of course, must eventually yield equivalent results. 
Let us first discuss the Schottky groups and rewrite the single-loop amplitude 
in a form that can most easily be generated to the multiloop case. (We will 
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discuss the theta function method in Section 5.11). Let us define a projective 
transformation (2.7.1) that forms the group SL(2, R), ina way that emphasizes 
its geometric properties. Let us define the invariant points x, and x of the 
projective transformations to be the points that are left invariant under the 
transformation: 

P(x) =%1, 
POG) =a 


Then we can rewrite an arbitrary projection transformation, which has three 
arbitrary parameters, in terms of the two invariant points and the mu/tiplier X: 


2(X2 — Xx}) — x1 x2(1 — X) 


invariant points: | (5.6.1) 


Pizy= (5.6.2) 
@) z(l— X)+%2X — x, 
Another convenient form for the projective transformation is 
P(z)— x1 eax 


The advantage of writing the projective transformation in terms of the multi- 
plier is that products of projective transformations have simple multipliers: 


(Xp)" = X(pr, 
Xpo = Xgp. (5.6.4) 
(Xp)! — X(p)-!. 


Under a projective transformation, any P can be brought to the form 


P viet (5.6.5) 
0 vet} 5.6. 


Notice that the trace of P can be written as 


1 
Tr P(z) = VX + ra (5.6.6) 
This, in turn, allows us to define “conjugacy classes.” Two projective transfor- 
mations P; and P, belong to the same conjugacy class if they have the same 
multiplier. 
Depending on the multiplier, we can define several types of projective 
transformations: 


(1) P is hyperbolic is X is real, positive, and not equal to one. 
(2) P is parabolic if X is real and equals one. 

(3) P is elliptic if |X| equals one and X does not equal one. 
(4) P is loxodromic if X is complex and none of the above. 


For a real projective transformation, the multiplier can be greater than or 
less than one, depending on whether x; < x. or vice versa. We thus have the 
freedom to choose all our projective transformations to be hyperbolic, so the 
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multiplier is less than one. For the single loop, we now recognize the projective 
transformation to be 


P(z) = we 


Jw 0 
P= 1 : (5.6.7) 


Thus, the multiplier of the single-loop transformation is simply w itself. 

Notice that we have 3N parameters associated with N projective transfor- 
mations. However, we noted earlier that three points along the real axis can 
always be fixed because of projective invariance. Thus, the open string N-loop 
amplitude will have 3N — 3 parameters that describe the surface. These are 
called Zeichmiiller parameters and are the minimum number of parameters 
needed to characterize inequivalent Riemann surfaces with a boundary. For 
the closed string, we have spheres and complex projective operators, so we 
have 6N — 6 parameters. In summary: 


closed string: 6N — 6, 


open string: 3N —3. (5.6.8) 


Teichmiiller parameters: | 


(Intuitively, we may say that it takes two parameters to locate the position of the 
center of a hole and one parameter to label its radius. Thus, 3N parameters are 
needed to describe a surface with N holes. We must subtract three to eliminate 
conformally equivalent ways of putting the external lines on the real axis. This 
makes a total of 3N — 3 parameters.) 

Let us now rewrite the integrand of the single-loop function in terms of 
manifestly invariant functions. From (5.6.4), we note that the partition function 
can be rewritten as 


: [a —w")= [Jd -(xe)") =] [a — Xp). (5.6.9) 
=) ie i 


Thus, the partition function is now written in terms of the set of projective 


transformations P”. 
Second, we can rewrite the momentum integrand in terms of projective 
transformations. Before, we showed that the combination 


17 Uh (Cea) ae (5.6.10) 
Lj 


transformed nicely under a projective transformation because of momentum 
conservation and the fact that the external lines are on-shell. Now, we want to 
show that the single loop can be written as an invariant integrand: 


du |] @—w'z)a=au [] @—Priyrea! (5.6.11) 


p= peal n=lLi<j 
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FIGURE 5.10. Action of a projective transformation. Under successive applications 
of the same projective transformation, circles surrounding one invariant point map 
into circles surrounding the other invariant point. After an infinite number of pro- 
jective transformations, a point on one of these circles comes arbitrarily close to the 
invariant point. 


Thus, the single-loop integrand can be rewritten as 


fan [ [@ — P"zyeh(L— Xpry 4A. (5.6.12) 
n=l 
i<j 
It is crucial that we identify the region of the complex plane that we are 
integrating over. In Fig. 5.10 we see how the upper half of the complex plane 
is divided into equivalent sectors by the action of the operator P. Notice that 
a projective transformation maps circles into circles. Thus. a point near one 
invariant point, after being repeatedly hit with P, gradually migrates over to 
the other invariant point. In the process, the upper half-plane is sliced up into 
disjoint regions. We can take any of these disjoint regions for our integral. In 
the figure, we have taken the region that is midway between the two invariant 
points. The arrows show how a point migrates under P, 1.e., we see the iden- 
tification of the points along the surface. When we add in external lines, they 
lie only on the x-axis and can only move up to the edge of these circles. 
This procedure easily generates to the multiloop amplitude. We demand: 


(1) Projective invariance under SL(2. R) transformations. There should be N 
projective operators for the N-loop amplitude. 

(2) Modular invariance for the closed string, so we integrate over one 
fundamental domain in parameter space. 


From these invariance arguments alone, we can almost write the unique 
multiloop amplitude. 
First, let us write the multiloop divergence. Let 


Ls eas aa (5.6.13) 


represent N real projective transformations, such that their invariant points are 
arranged sequentially on the real axis. Let { P} represent the set of all possible 
distinct products of the various P transformations, raised to any positive or 
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FIGURE 5.11. Schottky representation for the double-loop open string diagram. 
There are two sets of invariant points lying on the real axis, corresponding to each 
of the two projective transformations. The circles surrounding the invariant points 
are to be identified with each other. 


negative power. (In Fig. 5.11 we see that the invariant points are all aligned on 
the real axis.) The set {P} forms a group. Our region of integration will be the 
region of the real axis that exists between the limit points of the elements of 
{P}. If the limit points of all the elements of {P} fills up the entire complex 
plane, then this is uninteresting. Then there is no region of integration for our 
variables. 

Our interest lies in groups {P} whose limit points form a discrete set on the 
real axis, so there is a finite region of integration. Groups with this property 
are called Schottky groups. 

Let {P} represent all distinct products of these transformations modulo 
cyclic permutations. Then the divergent term for the N-loop amplitude is a 


product over the set {P} [4, 5]: 
| [a - Xa) (5.6.14) 
P) 


and the momentum-dependent term is - by a product over the set { P}: 


IT ce je. (5.6.15) 


i<j.{ 


The integrand for the N cn amplitude is therefore surprisingly easy to 
write. The only difficulty is determining the region of integration such that 
we preserve projective invariance. The final formula for the N-loop amplitude 
with M external tachyon legs is [2-8] 


ie / [] dp{ [| [a - Xa) 
a=! {P} (P} 
N 


x TP Gi trhegt TT] axexgt™ 


l=i<j=M B,A=1 
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k, ky 
‘S I] oO) 
fe) et ees 


i—] 


2) y G/2)k pack, 
a eel (5.6.16) 
ee ~ ee _ eee 
where 
M N 
d= | aaa elen en Ct ee (5.6.17) 
i=l a=1 


where the Roman letters i and j represent the external tachyon lines, and Greek 
letters represent only the loop variables. Each projective transformation P, has 
two invariant points and a multiplier: 


eee ar oe (5.6.18) 


The product over {P} is taken over all inequivalent products of the projective 
operators P,. (We exclude those products where there is overcounting: 1.¢., ifa 
member of { P} isa product of transformations that ends with the transformation 
P,, then it cannot be allowed to operate on the invariant point of P,. or else 
we will overcount.) 

The essential aspect of this integrand is the region of integration, which must 
count each conformally inequivalent configuration once and only once. 

On the real line, we can always make a projective transformation such that 
all the external lines are bunched together on one side and all the invariant 
points are bunched together on the other side. (If a point < lies to the left 
of the invariant points of a projective transformation P,, then P,(<) moves 
z across its invariant points to the other side. Thus, by successive projective 
transformations we can shove all external lines to one side of the real axis 
and all the invariant points to the other side.) We must be careful. however, to 
exclude the possibility of a projective transformation that whips a point all the 
way around the diagram. The transformation 

N 
P= ie (5.6.19) 
a=] 
has the property that it moves points all the way around the real axis past all 
the invariant points. Therefore, we must be careful to extract the periodicity 
that enters in by circling the entire real axis an arbitrary number of times. 


Let x" and x") represent the invariant points of this product. Then we have 
the limits of integration: 


( 


2 2 
x (1) (2) (1) Ee 


)< PQ) <<a < imate, ee 
(5.6.20) 

with the restriction that all multipliers for P,, including the product P, remain 

less than or equal to one. Of course, there are other equivalent choices for the 
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region of integration because we have taken a specific truncation of the region. 
In particular, we could have external lines placed between the invariant points 
of different projective operators. Because the set {P} was a Schottky group, 
we are guaranteed that there is a finite region of integration for our variables. 
(We must point out that the calculation of the multiloop amplitudes was done 
in 1970 using projection operators that eliminated the ghosts on the leading 
trajectory. With BRST ghost fields, the calculation can be redone to eliminate 
all possible ghosts. By the uniqueness of the Neumann function over a surface, 
we expect the same answer.) 

So far, we have discussed only open string multiloop amplitudes. Now, let 
us generalize these previous statements by actually writing down the Neumann 
function for a sphere with N handles or holes for the closed string sector [21]. 

Let us first consider the complex plane with 2N holes cut out. Call this 
region, which lies exterior to 2N holes, the region S. Let us call these pairs of 
holes a-cycles a; anda;,wherei = | to N. Projective transformations, we have 
seen, map circles into circles, so let us define N projective transformations P; 
which take us from one a-cycle into their partner (see Fig. 5.12) 


Pega (5.6.21) 


These projective transformations P;, of course, can in turn be parametrized by 
two invariant points 7 and oa and by the multiplier X;. We will find that 
the invariant points lie within each of the a-cycles, so they do not appear in 
the complex plane with holes cut out. In general, a point that lies within the 
surface § (the region exterior to all a-cycles) is mapped into the interior of one 
of the a-cycles by the action of a projective transformation P;. 

In addition to the a-cycles, we also have b-cycles, corresponding to cutting 
a line between a; and a;. The b-cycles are thus circular lines that encircle 
the ith hole when analyzing a sphere with N holes. Notice that the projective 
transformation z — wz in the single-loop diagram took us from the interior 
to the exterior radii of the annulus. Thus, the projective transformation moved 
us along the b-cycle. In general, the projective transformations P; will map 
points in one a-cycle into its partner, so that we are moving along a b-cycle. 

Let us now define V, to be the product of all distinct products of the various 
P;. Notice that i runs from | to N, while / runs over an infinite number of 
indices. Now let us define the function y: 


z— V;z'\(z’ — Viz 
In W(z, z' = Ine = 2')+ "Int rz) 12) (5.6.22) 
if 


z— V zz! — V;2') 


where the prime in the summation means that we include either V; or V;| but 
not both. Normally, we would expect to define the Neumann function as 


N(z, 2’) = In|w(z, z)I. (5.6.23) 


However, this function actually does not have the correct properties. We want 
to define automorphic functions that change only up to a constant when we go 
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FIGURE 5.12. Homology cycles of an arbitrary closed Riemann surface. The a and 
b cycles are 2g closed lines on the surface of genus g that cannot be continuously 
shrunk to a point. By cutting the surface along the a-cycles, we can flatten the surface 
into a plane with 2g holes cut out. Notice that the b-cycles are now lines that connect 
pairs of holes, which are defined by the a-cycles. 


around the various loops in the z-plane. The above expression does not have 
this periodicity property, which was the crucial criterion for the Neumann 
function. Thus, this function cannot be an automorphic function. 

The single-loop function, by contrast, did have this property. Notice that the 
function In z, for example, changes by 277i when moving around an a-cycle: 


In(ze?"") = Inz + 2vi (5.6.24) 
and by a factor of In w when going across a b-cycle: 
In wz = Inw + Inz. (5.6.25) 


These factors are called periods of the function Inc. The ratio of these two 
periods is 
In w 


——=T (5.6.26) 
201 
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which is called the period matrix. Thus, t is the period matrix for a single 
loop. 

In this way, we can show that the entire single-loop Neumann function 
changes only up to a constant when traveling around the a- or b-cycles. 

We can show that the previous N-loop candidate for the Neumann function 
is invariant when traveling around the rth a-cycle: 


Ov (Zz, 7) =0. (5.6.27) 


However, we can show that this function, when we travel around a b-cycle by 
making a transformation z > P,(z), is not invariant. It is not periodic because 


Sor In W(z, 2’) = —v,(z) + v,(z’) — int, + Six —In(c,z+d,), (5.6.28) 
where 
= zi) 
v(z) = Det ‘In = ayaa (5.6.29) 


(the summation symbol (r) simply means that, to avoid double counting in 
the combination Vig we must delete any P, that might be acting on the 
invariant point z''?), and c and d correspond to the factors found in the original 
definition of the projective transformation. When we take the function v,(z) 
and move z around the sth a- or b-cycles, we find 


bas Ur(Z) = 27i4;5, 


dps Ur (Z) = 2g i Gres (5.6.30) 
where 
thy V (1) z(2) =; (2) 
t= In Se E511 Mee) ec i. ied, (5.6.31) 
2mi (2? — Viz ya? — Vik?) 


The matrix t,,, which we get by taking v, around the sth b-cycle, is again 
called the period matrix, which is a natural generalization of the period matrix 
t we found earlier for the single-loop diagram in (5.5.2) and (5.6.25). 

With the generalized definition of the period matrix, we can write the com- 
plete Neumann function that has the correct periodicity properties. We modify 
w to read 


: 1 
In W(z, 2’) = Inw(z, 2) — 5 > Re(v,(z) — v-(z’))((im t)"),s 


x Re(u;(z) — v;(z’)), (5.6.32) 
so that the Neumann function becomes 
N(z, 2’) = In|W(z, z’)). (5.6.33) 


We can put everything together and write the N-loop amplitude for the 
closed string. The changes that occur from the single-loop amplitude are: 


210 5. Multiloops and Teichmiiller Spaces 


(1) The single-loop partition function now becomes 


I] ee eee (5.6.34) 
if 


where we take the multiplier of the product over all inequivalent conjugacy 
classes of V;. 

(2) The factor (In|w|)~', which was related to the period matrix, is now 
replaced by the determinant of the N-loop period matrix: 


det | Im r|,... (3:6.35) 
(3) Another factor of 
i =a (5.6.36) 


occurs in the amplitude. 


The final result is therefore [21] 


Qua aed x, aa 
AN-loop = ial i ee ia jae” = ei ‘ 
x Emel? PY — XT TG. zk, 6.6.37) 

I 


le 


where we integrate over the fundamental region F of the surface with V holes. 
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Although the previous calculation gave us an explicit formula for the multiloop 
amplitude in terms of Schottky groups, we also had to be careful in truncating 
the region of integration so that we avoid overcounting. This seems to be 
an inherent problem with the Nambu—Goto formalism, where the functional 
integral does not uniquely fix the region of integration. 

The Polyakov formalism, however, provides a way in which we can elim- 
inate the overcounting from the very beginning, using powerful theorems for 
Riemann surfaces. One of the greatest advantages of this formalism is that 
we can bring to bear the full force of the last century’s worth of mathemati- 
cal research on Riemann surfaces. Of special importance will be the fact that 
the determinant found earlier, which contains the singularity structure of the 
N-loop diagram, can be expressed in terms of the Selberg zeta function. 

Recall that the Polyakov action is given by (2.1.35) 


| 2 
= = | aezn/ eee Oak Oy X Gia) 
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The generating functional is 


l= Si / Dean / Dxee- G72) 
metric embeddings 


topologies 


where we must be careful to divide out Riemann surfaces that are equivalent to 
others by a conformal transformation. We will follow the derivation of Alvarez 
[22] for closed strings. 

The functional integration over the string variable X is a Gaussian and hence 
easy to perform (see (1.7.10)): 


—(1/2)D 
[ Pxexp (— f eaveea.x"dx,.) = == det'(— v4) 
G73) 
where 
—1 
V? = —=8n/og™" On, (5.7.4) 


J8 
where the prime on the determinant will always mean deleting the zero mode. 

The partition function (5.2.14), which contained the divergence of the single 
loop, comes out of this determinant. 

However, the functional integration over the metric is considerably more 
involved because of the presence of gauge parameters and also the presence 
of loops. As we saw in (2.4.1), the measure is invariant under 2D general 
covariance and rescaling: 


S8ab = Bacd,5V° + Og5V° Beh — Oc5V° Bab + 250 Lap. (5.7.5) 
If we add in the Christoffel symbols, we can rewrite this expression covariantly: 
b2ab => V,6Up =p VpdV, aia 250 Bab, (5.7.6) 


where o parametrizes a Wey] rescaling, and dv, parametrizes a reparametriza- 
tion of the two-dimensional surface. In general, this means that the measure of 
integration over g,» is actually infinite. Notice that g,,, has three independent 
components and that dv, and do also have three components, so that naively 
we expect that we can set all the components of the metric to the delta function: 


Sab = Sab. GLy) 


Naively, choosing the conformal gauge is equivalent to factoring out the inte- 
gration over the infinite volume due to Wey] rescalings and to two-dimensional 
reparametrizations over the surface. Gauge fixing thus means replacing the 
functional integration over metrics with 


Dgar > D8av(Qnitr) '(Qwey) |; (5.7.8) 


where the infinite volume of the space can be represented as (see (1.6.7)) 


Qin = f Drs 
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Wey! = | bo. (5.7.9) 


For spheres and disks without holes, this is actually true. However, for 
surfaces with larger numbers of loops, or handles, this is no longer true because 
of complications due to the parametrization of the loops. 

In general, a sphere with N holes or handles requires a number of param- 
eters to describe the location and size of each hole. Basically, for a disk with 
holes we need one parameter to label the radius of each hole and two more 
parameters to give us the coordinates of the center of each hole. Thus, we need 
3N parameters to describe N holes. (For a sphere with handles, we need 3N 
complex parameters to describe N pairs of holes.) As we saw earlier, three 
parameters can be fixed overall (and set equal, for example, to 0, 1, and oo), 
so a disk with N holes can be described by 


3N -3 (5.7.10) 


real parameters, or double that number if the surface is a sphere with N handles. 
Thus, g,, cannot be set equal to 54, for surfaces with higher genus (holes). 
However, any two-dimensional metric is conformally equivalent to a metric 
of constant curvature. By a conformal transformation, we can always set the 
curvature to a constant. Thus, the space of metrics over which we want to 
integrate is the space of constant curvature metrics divided out by the diffeo- 
morphisms on the surface M. Moduli space is the space of constant curvature 
metrics when we eliminate the overcounting arising from reparametrization 
invariance: 


. Meonst 

moduli space = Diff)” (S701) 
However, as we saw in the single-loop discussion in Section 5.5, there are 
actually two kinds of reparametrizations, those that can be connected to the 
identity and those that cannot. In the previous identity, we divided out by the 
set of all diffeomorphisms of the surface. Thus, we also have the possibility of 
dividing out by the diffeomorphisms Diffy(A/) that are connected only to the 
identity. The resulting space is called Teichmiiller space 


Meonst 
Diffo(M)- 
The relationship between moduli space and Teichmiiller space, of course, is 


necessarily very close. In fact, they are actually equivalent up to the action of 
a discrete group, called the mapping class group: 


Teichmiller space = (S725 


eine Teichmiller space 5.7.13 
pace = MCG : (S71 
Thus 
Diff(M 
MCG = wa) (5.7.14) 


Diffo(M) 
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FIGURE 5.13. Action of a Dehn twist. The torus has been cut along its a-cycle. One 
end has been twisted one full turn and then rejoined. Notice that the original b-cycle 
has now become the sum of an a- and a b-cycle. This mapping of the torus into itself 
cannot be continuously deformed back to the identity map. The set of all Dehn twists 
generates the mapping class group. 


In other words, the only differences between Teichmiiller space and moduli 
space are global diffeomorphisms that cannot be connected to the identity. For 
example, think of cutting a torus as in Fig. 5.13, twisting one of the sliced 
edges by 277, and then reconnecting the two edges. This is called a Dehn twist. 
Notice that this transformation generates a diffeomorphism that cannot reach 
the identity. It is a global diffeomorphism. Thus, Teichmiiller space is “larger” 
than moduli space because you have to divide out the global diffeomorphisms, 
or Dehn twists, to arrive at moduli space. The dimensions of both these spaces 
are given by 


0, if N = 0, 
dim Teichmiiller = dim moduli = { 2, itn = 1. oy Filea) 
6N — 6, if N= 2. 


This, of course, is precisely the number of parameters necessary to describe 
a sphere with N holes. Thus, this description gives us the necessary param- 
eters to parametrize the N-loop diagram. (We note that the modular group 
and the mapping class group are identical, and we will use these two terms 
interchangeably.) 

In practice, the volume factor due to the mapping class group can be factored 
out trivially, so that, for our purposes, we may treat Teichmiller space and 
moduli space as essentially the same. 

So far, the discussion has been general. To actually extract out these extra 
parameters that describe the holes of a Riemann surface, we need to use the 
theory of Teichmiller spaces. 

_ Some of the manipulations are a bit involved, so we must always clearly 
have our goal in mind, which is to rewrite the functional measure Dg,,, in 
terms of the parameters Dv, Do and the 3N — 3 Teichmiller parameters Dt;. 
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Thus, our goal is to establish 
objective: Dg,, = uDv,Do Di;. (5.7.16) 


Then, by simply dividing out by Du, and Do, we will have successfully 
eliminated the infinite redundancy introduced by reparametrization and scale 
invariance. 

Let us now rewrite the variation of the metric tensor (5.7.6) in a more con- 
venient form, revealing the fact that g,,, is also a function of f;, the Teichmiller 
parameters. Using the chain rule, we can formally represent the dependence 
of the metric on the Teichmiiller parameters via 51;5/5t': 


b8ah = [Vudu + Vp8Va = (V.d0°) gap | 2h (V.dU°) Ban tr 260 Bab si ét'T', 


where 


yi — 282 _ (trace), (5.7.17) 
at! 
where we have explicitly taken out the variation of the metric as a function of 
the 3N — 3 parameters (Teichmuller parameters) that we label r' associated 
with the N loops. Notice that we have subtracted out the trace in the brackets 
and that, at this point, we do not have to specify precisely how the metric 
depends on the various ¢;. 
This variation can be rewritten simply as 


dab = P (5V)ap a5 266 Bab Sie 51' 9; Rap, 71S) 


where 
Py (dV) ap = V,5Up + V,dU, = Cap cove: (5.7.19) 


The operator P; plays a crucial role in the theory of Teichmiiller spaces. Notice 
that it is an elliptic operator that maps vectors into traceless symmetric tensors. 
Let ker P; represent the kernel of the operator, i.e., the set of vectors that are 
mapped to zero by the operator. Ker P; is called the set of “conformal Killing 
vectors.” For surfaces with genus N, the dimension of the kernel of the operator 
is given by 

dim ker P; = 6 for genus 0, 

dim ker P;} = 2 for genus 1, (5.7.20) 


dimker P} = 0 for higher genus . 


We also wish to define the adjoint of P,, which we will call le To do this, 
of course, we first have to define how to take the inner product. Let us define 


Sean? = | dz, [B 0% 2" Seay eoa, (5.7.21) 


|Svepll? = | d?z./e9q5v%5v". (S022) 
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Once we have defined a scalar product, this allows us to define the adjoint P; 
via the definition (a| Pb) = (Ptalb): 


Pi (6g)a = —2V"Sgap. (5.7.23) 


Let us now analyze the space ker Py, i.e., the space of vectors that are annihi- 
lated by Py If we rewrite everything in terms of z and Z, then the elements of 
ker P;' satisfies 


V'6e,. = 0:82,, = 0. (5.7.24) 


The kernel of lay is spanned by what are called quadratic differentials. For- 
tunately, the dimensions of the spaces of quadratic differentials for Riemann 
surfaces are known: 


dim ker P' = 0 for genus 0, 
dim ker P/ = 2 for genus 1, (67725) 
dim ker P' = 6N —6 for genus N. 


Thus, the number of parameters within the kernel of jee is equal to the number of 
Teichmiiller parameters needed to describe N loops or handles. Symbolically, 
we can summarize how to divide the measure of integration into its constituent 
parts as 


(5gap} = {50} ® {P5v,} @ {ker P/}. (5.7.26) 


This equation has a rather simple meaning. It says that the components within 
g.,» can be broken up into three parts, a dilation part, a traceless part, and also 
the hidden Teichmiller parameters. It also means that we are very close to 
attaining our goal, (5.7.16), but there are some subtle complications. 

Let us explicitly make a change of variables and calculate the Jacobian of the 
transformation. Let us first assume that there are no Teichmiller parameters 
to worry about. Then we make a change of variables from h,» (which is the 
traceless part of the metric) and t (which is the trace of g,,,) to du, and o: 


a(t, h) 
a(o, v) 


Dea» = det Do Diy, (5.7.27) 


where 
a(t, h) 1 X a 

det det = det P, = [det P, P;]'”, S28) 

et | | E P, i=l iP; ] ( 

where the value of X is arbitrary, since it drops out of the determinant. 
Notice that the square root of the determinant of P; Pi is just the Faddeev— 
Popov determinant for the conformal gauge, which in ane can be written in 
terms of the BRST ghosts. Thus, we can write the Faddeev—Popov determinant 

1 first found in (2.4.3) as 


det!/? P,P! = App = | DbDce~***. (5.7.29) 
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Next, we wish to calculate the Jacobian factor due to the fact that the mea- 
sure actually depends on the Teichmiiller parameters t;. The problem is that 
the Teichmiiller parameters are not orthogonal to P;du,, so the Jacobian will 
contain cross terms that have to be factored out. 

Let us begin our discussion by introducing a set of 3N — 3 complex fields 
yw“ that will be an orthogonal basis of ker P,’. Let us now decompose the factor 
[22,231 


: O02, 
i= — — (trace) (5.7230) 
into the following identity: 
l 
Ti=(1-P,——P | Ti+ Pj——pyT'. (Soe 
PP, ee 


It is important to note that we have done nothing. We have only added and 
subtracted the same term. However, the term that we have introduced contains 
the operator P; acting on another state. Thus, the above identity is useful 
because it allows us to extract the piece of T' that lies along the direction of 
Pe 


fee a Pie (5.732) 
where 
: 1 ; 
v= —— PIT’, (5.7.33) 
(Ww, T') = / dafeeo ei. (5.7.34) 


As expected, there is a piece of T' that lies along the direction P; that must be 
accounted for when we write the Jacobian. Let us now insert this expression 
back into the measure for 5g,,: 


Sguvll” = I|5o || aa One WoV"Ya. vr) an’, PP aiat, a 1 Prdvy ll. (Sie) 


This, finally, gives us the Jacobian that includes the contribution from the 
Teichmiuller parameters. We have also explicitly taken into account the fact 
that 7’ originally contained a piece that lay in the direction of P,: 


det(W., T) 
Dgup = Do Dv, Dt; det'?(P} pea 
det “Wha, Wp) 
This, in turn, allows us to put the entire functional integration together into 
the following factor: 


/ Die Digan Q5i¢2Weyi = | Dv Q5ie Qyeyi do 


(5:7.36) 


det(y?, T') 


x Dede! ?P CR Ep 
( 1 LIES 
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—(1/2)D 
(<2; Fez Ge a v")) e037) 


We have finally attained our goal, (5.7.16), up to the question of anomalies 
that may spoil scale invariance. 

Although this discussion might appear long and difficult, the end result is 
quite simple. The final answer shows that the measure of integration, including 
the Faddeev—Popov ghost determinant, can be written totally in terms of deter- 
minants of the operator P; and its adjoint. There are three parts to the measure: 
(1) first is the Faddeev—Popov term, which is written as the square root of the 
determinant of P, Pe ; (2) the second is the term involving T', which arises 
because the moduli parameters t; are not ee to the basis vectors 
w® which form an orthogonal basis for ker P;'; and (3) the third factor is the 
determinant of the Laplacian, which we will aes show can be written in terms 
of the operator P;. 

However, it is not yet possible to remove the integration over the scalar 
parameter o in (5.7.37). Unfortunately, the various measure terms contain the 
scalar parameter. We wiill find, in fact, that in general the scale factor cannot 
be eliminated from the measure terms at all, breaking conformal invariance. 
In the next section, we will show that there is an obstruction, an anomaly, to 
scale invariance, called the conformal anomaly, which can be removed only if 
the dimension of space-time is 26. This fixes the dimension of spacetime. 

In the process, we will show how to write all determinants totally in terms 
of the P; operator. 


5.8 Conformal Anomaly 


We note that we can cancel the integration over the volume of the 
reparametrization group, because 


ef Dv, = 1. (5.8.1) 


Next, we wish to eliminate the Weyl rescaling term: 


om | Dov= 1. (5.8.2) 


However, the integration over the o rescaling term is considerably more com- 
plicated, because o terms exist in the other factors of the measure as well. 
Thus, we must very carefully extract all o-dependent terms from each of the 
other factors before we can eliminate the rescaling term. It will turn out that 
the Wey] factor can also be factored out of the measure, but only if the dimen- 
sion of spacetime is 26! The calculation of the conformal anomaly is rather 
involved, so we will only sketch the highlights of the calculation. 
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First, we note that the term (T', 7“) is already Wey] invariant. This is because 
if we rescale according to 


Lab = @ gan, 
Ti =eT', (5.8.3) 


we notice that by the very definition of (T', y“), this term remains invariant. 
(The w variable remains unchanged under a Weyl rescaling.) 
Thus, the only term we have to worry about is 


det’ P'P, |” 2 
det(w“, wy”) | Cae 


(The prime still means that we have extracted out the zero mode of the 
determinant.) 

Fortunately, the extraction of the Weyl rescaling parameter can be performed 
simultaneously on both terms if we use a few facts about Riemann surfaces. 
The trick is to rewrite P,, ae and V? in a way such that all three operators can 
be expressed in terms of the same differential operator. 

Let 


(5.8.4) 


ijlkmn... 
Pease (5.8.5) 
represent an arbitrary tensor in the two-dimensional Riemann surface. Of 


course, we can always rewrite this tensor in terms of complex coordinates 
z and z. In these coordinates, we denote as K” the set of all tensors 7 that 


transforms as 
dz’ \" 
Ke {rit > ( ) r| (5.8.6) 


od 


~ 


(e.g., see (2.7.6) and (4.1.7)). Clearly, the operator 


VIF =(g*)*0, [Gey F | (5.8.7) 
maps a tensor T into another tensor. More precisely, it maps KY“ into A% !: 
(Ria Ka (5.8.8) 


It is also possible to define an operator that acts in the opposite direction. The 
operator 


Vil = 87021 (5.8.9) 
maps K? into K9*!: 
Wak? SR 7). (5.8.10) 
Now let us define the operators 


Veo 
ee ey (5.8.11) 


5.8 Conformal Anomaly 219 


and 
ae! 0 
pt- é 
( 0 ge i (5.8.12) 
q 
Using this notation, we can show 
VeVi 0 
=—PiP= | | 5.8.13 
be ( 0 Vi,Vvz! ne) 


Thus, 
det P/ P; = det(V?V;) det(V2,Vz') 
= det Aj det A_,. (5.8.14) 
Since the + Laplacian is the complex conjugate of the — Laplacian, we have 
det!” P' P, = det At. (5.8.15) 
Finally, we can also define 
Aj = ViV?. (5.8.16) 


Armed with these definitions, we have now expressed the determinant in the 
form 


det P'P, | i _ det A} 
det(y7, wr?) {[@ofg ~ det!/(we, w) 
(5.8.17) 


In other words, both pieces of the Jacobian, which at first appear dissimilar, 
are now represented by the same operator ee for gq = | and for g = 0. Now, 
it remains to calculate the variation of the previous expression when we do a 
Wey] rescaling. It is convenient to write the equation in terms of the heat kernel 
of the determinant: 


(det Ajym a=. 


Tat 
log det H = -| ze Trees (5.8.18) 


where € is a small number. Using this equation, we can now express the varia- 
tion of the determinant due to a Wey] rescaling. The actual calculation is quite 
involved, so we will only quote the final result [22—24]: 


det AZ 1+ 6q(1 + q) 
6 log REM ¥) 7s (ae 


where R is the contracted curvature tensor in two dimensions. Thus, we want 
to calculate this factor when gq = 1 and g = 0. The final result is 


Stoo | ct Pi — 
oO a, | 
S| det 2(ye, wr) \ FPzve 


d*z./gRéo. (5.8.19) 
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13-—4D 
a / d°z./gRéo. (5.8.20) 
12a 


Notice that for D = 26, the conformal anomaly disappears. Thus, we have 
shown 


D— 26. (Cee, / Do=1. (5.8.21) 


Our final result for the functional is thus 


=13 

gre det'(—V? 

Z = f on ety” Perl a) | igs 
det!/ (Ww, w’) g { @z/8 


where all terms are evaluated with the metric g,, where the o term has been 
completely factored out. Thus, our goal of attaining (5.7.16) has now been 
accomplished, at least in 26 dimensions. 

Now that we have developed this powerful formalism, it is convenient to 
reduce this case to the problem of the single-loop amplitude and rederive our 
earlier result (5.3.12) in terms of our new perspective on Riemann surfaces 
(23k: 

For a single-loop graph, the region of interest is a doughnut. Our flat space 
metric 2, corresponds to a lattice in C: 


w£+ aL, (5.8.23) 


where Z corresponds to an integer. Thus, the region of interest is completely 
determined by the ratio 


15 = @/W}. (5.8.24) 


And the fact that we normalize the domain to have unit area. Then, the mapping 
class group is simply SL(2. Z), and the fundamental domain, as before, can be 
taken to be (t = tT; + it) 


Iz] > 1, 
— 7 (5.8.25) 
= 5 | aie 
The measure 
(det’ '/? PtP) (ma det’ ai) (5.8.26) 
fri@s./e 
can be further reduced. For example, the factor (5.8.26) equals 
set A) "Gaga": (5.8.27) 
where 
oo ; 4 
det’ A =e" ery [d= e7 je (5.8.28) 
n=1 
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The invariant measure can be written, as before, as 
ied t/t. (5.8.29) 
Combining everything together, we arrive at 


20 . 
[ [a = a 
in 


—48 


é (5.8.30) 


d T d 15) 
Z = 5 Qt) "Ee" 
fund region T 


as in (5.5.4), 


5.9 Superstrings 


Fortunately, the generalization of multiloop amplitudes to superstrings is 
straightforward, except for the complications due to defining spinors on 
Riemann surfaces of genus g. 

Let us consider the torus in two dimensions 0 , 02, constructed on a parallel- 
ogram by identifying opposite sides. A string X defined on the parallelogram 
must satisfy periodicity properties: 


X (01, 02) = X(o, + 20, 02) = X(0), 02 + 277). (5.9.1) 


However, the addition of spinors to this surface increases the number of possi- 
bilities. A spinor can be either periodic (+) satisfying R boundary conditions 
or antiperiodic (—) satisfying NS boundary conditions in either o; or 02. (So 
far, we have only considered different boundary conditions in the o direction 
in (3.2.16), not in the t.) Thus, the total number of different combinations 
for boundary conditions is four. The possible choices we can take to define a 
spinor on a torus correspond to (+, +). We say that these four possible choices 
for a torus define its spin structure. 

Furthermore, we know that modular transformations will mix up the bound- 
ary conditions, and hence the spin structures, so all four of them can contribute 
to the final amplitude. For example, the transformation 


(01, 02) > (01 + 02, Or) (5.9.2) 
changes (—, —) into (+, —). Likewise, the transformation 
(0), 62) — (02, —0) (3.9.3) 


interchanges (+, —) and (—, +). Thus modular invariance, which interchanges 
the boundary conditions, forces us to have (+, —), (—, +), and (—, —) in the 
amplitude. (++) is invariant by itself. 

To calculate the contribution of each spin structure to the one-loop amplitude, 
let us calculate the trace of the Hamiltonian over all four possibilities. The 


Hamiltonians for the NS and the R sector are 
2 ie, @) 


Ays = )_ rviwi- &, (5.9.4) 


f— 2 
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oo 
Hp=) aw wit i (5.9.5) 
| 


(3 comes from zeta function regularization. See Chapter 10.) These Hamil- 
tonians are either NS or R depending on their periodicity properties in the o 
direction. However, when we take the trace over these Hamiltonians, we are 
inserting a complete set of intermediate states in the t direction. The path 
integral in the t only selects out the antiperiodic boundary conditions, which 
is incomplete. We must modify the sum to include all possible spin struc- 
tures, which is made possible by the insertion of a factor of (—1 )*, where F 
is the fermion number. The final amplitude is the sum of amplitudes, A(+, +) 
defined for all four traces, multiplied by some coefficient C: 


AD] CG 4G), (5.9.6) 
a5 


Explicitly, each trace can be written as follows: 


A(-, —) = Tre?" = |@3(0 | t)/n(z)), 

A(+, —) = Tre?""#* = |0,(0 | r)/n(t)], 

A(—, +) = Tr(e7"*85(—1)") = |0,(0 | r)/n(z)T’, (5.9.7) 
A(+, +) = Tr(e77'"®(—-1)") = 0, 


where 7 is the Dedekind eta function: 


n(t) =e etit/l2 [ [a _ ey, (5.9.8) 


| 


Notice that the last trace vanishes by itself. (The first three theta functions 
© 3.4 are all even under z -> —z, while ©, is odd. Thus, only the even spin 
structures survive in the trace.) 

The significance of this is as follows. For the NS sector, for example. we in- 
clude both the (—, +) and (—, —) sectors, so we have to add their contributions 
to the trace: 


Tr + (— 1)" je (5.9.9) 


But this is precisely the GSO projection [26]. which projects out states even 
under (—1)/! We therefore have a new, physical interpretation of the GSO 
projection operator, which was introduced in (3.6.12) to eliminate the non- 
supersymmetric sectors of the NS-R theory. It’s surprising that modular 
invariance, supersymmetry, and the GSO projection are so intimately linked. 

When we calculate the vacuum amplitude, which has no external legs (and 
which appears in the calculation of the cosmological constant), we must add 
all four contributions to the spin structure. But then we have the remarkable 
result of Jacobi: 


©3(0 | r) + O40 | rt) — O40 | t) =. (5.9.10) 
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This states that the vacuum energy of the superstring, i.e., the one-loop 
correction to the cosmological constant, vanishes exactly! 

To sum up, we have proved that modular invariance and the GSO projection 
operation are essentially the same for the closed superstring. Modular invari- 
ance, which mixes up the boundary conditions, demands that we add all four 
spin structures to the final amplitude, which in turn corresponds to adding pre- 
cisely the GSO insertions of the operator (—1)* into the trace. Originally, the 
GSO projection was imposed on the NS—R superstring in order to have super- 
symmetry. We now know that the GSO projection enforces modular invariance 
as well [27]. 

As a bonus, we find that the one-loop contribution to the cosmological 
term is precisely equal to zero. (However, this does not mean that superstrings 
solve the vanishing cosmological constant problem. After supersymmetry is 
broken, we no longer expect that the vacuum contribution is zero, and hence 
superstring theory still does not explain why the cosmological constant is zero 
after supersymmetry breaking.) 

These statements reinforce our conviction that the internal consistency of 
the superstring theory is quite remarkable. 

Now let us generalize our comments to construct multiloop amplitudes with 
external lines. 

Fortunately, the apparatus that was constructed for Riemann surfaces carries 
over rather simply to the superstring case if we use the NS—R formulation of 
the model. 

Generalizing (5.7.2), the correlation functions can be computed from [28]: 


Z= | Deun | px" f ou" f Dyse™. 
ae es metrics embeddings 


topologies spin 
structures 
(5.911) 
The new addition here is the integration over the two anticommuting vector 
fields w and x and also the sum over all spin structures. 
We showed before in (3.4.5) that the NS—R action is invariant under 


8X%q = Dub. (5.9.12) 


Let us now, in analogy with the bosonic string, construct two operators P)/2 
and Pin such that 


(P128)q = 2Dyé — pap? Dye. (5.9.13) 
In addition to the integration over moduli, we now have to integrate over 
supermoduli, which are defined as the space of traceless Xq that cannot be 


’ gauged away by a local supersymmetry transformation. Thus, they satisfy 


Pe x? = 0. (5.9.14) 
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For a surface of genus N, the dimension of the supermoduli space is equal to 
the dimension of ker aye. 


0  ifN =0, 
Z if N = 1 (periodic—periodic), 
i ne 59.15 
aac: hay if N = | (otherwise), Bie 


4N —4 ines 2: 


(where periodic—periodic stands for boundary conditions on the NS-R field). 
We wish to to construct the Jacobian for 


DgwDxa- (5.9.16) 
The Jacobian for the fermion field is calculated as before: 
4N-—4 
Dyq = (det Pin Pipy Pe De Di | | daz, (5.9.17) 


i) 


where a; are the supermoduli, the counterpart of the Teichmiller parameters, 
and where the integration over the fermionic A and ¢ represents the inte- 
gration over the redundancy introduced by supersymmetry and super-Weyl 
transformations. We define 

l 


= PzV/B( 80.0340 — Lidy" Ded 
481 


+ p2(e — 1) +2732 prac + LR.o), (5.9.18) 


where R is the curvature tensor. Putting everything together. we arrive at the 
following: 


DgahDXa = (det! P| P,)(det-"? Pt, P12) 
4N-4 6N—6 
xe Do Dv,DADE | | Da; [|] Dt;. (5.9.19) 


1 i 


By dividing out by Do, Dv,, DA, and DZ, we extract out the infinite re- 
dundancy introduced by the symmetry of the superstring action. In practice, 
however, the parametrization of the moduli, especially beyond the third loop, 
is quite difficult. The nature of the supermoduli, unfortunately, is even less 
understood. These are some of the main stumbling blocks to a definitive un- 
derstanding of the multiamplitude, i.c., the choice of coordinates on a genus g 
Riemann surface consistent with modular and supermodular invariance. 


5.10 Determinants and Singularities 


The advantage of the Riemann surface method is that, at least formally, we can 
obtain results for the general singularity structure of the N-loop amplitude. 
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It turns out that the determinant that contains the singularity of the N-loop 
amplitude is the Selberg zeta function. 

We saw earlier that projective transformations naturally fall into conjugacy 
classes. Two projective transformations are said to be within the same con- 
jugacy class if they have the same multiplier. Thus, any two members of the 
same conjugacy class can, by a projective transformation, be brought to the 
form 


z— e'z, (5.10.1) 


The value / is sometimes called the “length” of a closed geodesic. Let z and 
z’ represent two points in the complex plane. Then we define the “distance” 
between these two points as 

eral 
2ImzImz’ 
The advantage of this definition of the “distance” between two points is that it 
is the same for all elements of the same conjugacy class. 

We saw, from the example of the single loop, that the multiplication by 
the multiplier w moved across the b-cycle. Topologically, this simply means 
executing a closed geodesic on the Riemann surface. Thus, / is the length 
of simple closed geodesics on the surface. We say that a transformation is 
primitive if it is not a power (greater than or equal to 2) of any other element 
of the set of projective transformations. Thus, we are interested in the set of 
primitive geodesics. We define the Selberg zeta function as [23] 


Z,(s) = al [ [i — ve, (5.10.3) 


| primitive p=0 


A (5.10.2) 


where we take products over the lengths of closed primitive geodescis y on 
the surface, and v(y) = 1 for bosons and v(y) = +1 for fermions, depending 
on the spin structure. 

Remarkably, we can write the various determinants found earlier in the 
multiloop calculation in terms of the Selberg zeta function [28]: 


det’ AZ = Z’(1)e70x™), 
det Af = Z'(2)e 1x, (5.10.4) 
The fermion determinants can also be expressed as follows: 
(det PY. Piya)? = eee ,(), 
=(1/2)X(M) 


é€ 
(det’ y* Duy? Dg), = Gp 2 (5), (5.10.5) 


where y is the Euler number (p is the number of zero modes of y* D,). The 
» numerical constant c is given by 


2en= >.  (2|n| — 2m — 1)log(2|n| — m) 


O0<m<{n|—1/2 
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—(\n| +4) +(n— [n]P +n + $)log 2x + 2¢(-1). (5.10.6) 


Thus, by analyzing the structure of one function, we obtain the singularity 
structure of the N-loop superstring! 

Mathematically, it is known that the Seiberg zeta function is well behaved 
unless the Riemann surface degenerates topologically. For example, we find 
a divergence as the length of the primitive geodesic approaches zero. This 
corresponds to the “neck” of one of the handles of the sphere stretching to 
infinity. By carefully analyzing the Selberg zeta function as one of the lengths 
of the primitive geodesics goes to zero, we find 


' 13 
det'/? PtP (oe a) ~ [eh lt (5.10.7) 
NOU ees 
which reveals the pole corresponding to the tachyon vanishing into the vacuum. 
Being able to isolate the singularities of the multiloop amplitude in terms ofa 
known mathematical function, the Selberg zeta function, is a great step toward 
solving the main problem facing string perturbation theory. 1.e.. rigorously 
proving finiteness to all orders. Formally, it can be shown that the divergences of 
the multiloop amplitude may occur when the integration over moduli changes 
the topology of the Riemann surface, e.g., when two holes separate and create 
a long goose-like neck. However, there are many delicate points related to the 
question of the cancellation of these divergences that have not been totally 
solved. Although preliminary results are encouraging. the rigorous proof of 
the cancellation of divergences is still an outstanding problem. 


5.11 Moduli Space and Grassmannians 


Although enormous progress has been made in elucidating the mathematical 
structure of multiloop amplitudes, in some sense the tangible results have been 
disappointing. Back in 1970, it was already known that the divergences of the 
multiloop amplitude, by explicit calculation, corresponded to the deformation 
of the topology of a Riemann surface [4, 5]. So tar, the elaborate mathematical 
techniques we have introduced still havent answered the key questions: Can 
we rigorously show the theory is finite to all orders? If so, how do we sum 
the perturbation series? How do we extract nonperturbative information trom 
the theory? Knowing that the divergences of the theory can be expressed in 
terms of Selberg zeta functions [23, 29 32], although an important result, still 
hasn’t solved these mysteries. In summary, the progress has been in the area 
of mathematics, rather than physics. 

At this point, we can take at least two diverging attitudes. We can give up 
on the perturbation series and proceed directly to the field theory of strings, in 
which nonperturbative information might be extracted. This is the traditional 
approach taken in ordinary point particle theories and is the subject of the next 
series of chapters. Or second, we might imagine some kind of symmetry, such 
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as modular invariance, by which we might be able to manipulate the entire 
sum over all Riemann surfaces of arbitrary genus. 

In ordinary point particle Feynman diagrams, the second strategy is proba- 
bly impossible. The symmetries are too small, and the Feynman diagrams are 
graphs, not manifolds. However, the Feynman series for string theory are sums 
over manifolds, Riemann surfaces, where modular invariance play a crucial 
role. Thus, it is conceivable that the entire perturbation series might be ma- 
nipulated mathematically. This is the approach of Friedan and Shenker [33], 
who propose exploring the properties of the “universal moduli space” of all 
Riemann surfaces, including infinite genus. 

Until recently, this program was too ambitious and complex to extract mean- 
ingful results. However, two developments have given some impetus to this 
approach: 


(1) First, Belavin and Knizhnik [34] have shown that the measure for the 
multiloop bosonic amplitude is simply the absolute value of a certain 
holomorphic 3g — 3 form. This is called “holomorphic factorization.” 
In principle, it may make possible writing the multiloop measure by in- 
spection! In practice, however, there are certain problems in specifying 
the parametrization of the period matrix beyond three loops. This is called 
the “Schottky problem.” Holomorphic factorization has made the two- and 
three-loop measure almost trivial, but the Schottky problem prevents the 
multiloop measure from being easily written. 

In 1984 mathematicians finally solved the Schottky problem. Thus, a key 
obstruction to utilizing holomorphic factorization seems to be removed. 
Solving the Schottky problem, moreover, provides us with an even more 
powerful tool. It allows us to describe an infinite-dimensional space, called 
the ‘““Grassmannian” (Gr), in which all Riemann surfaces of genus g are 
treated as single points [35-37]. Thus, by studying the properties of the 
points in the Grassmannian, it may be possible to manipulate the set of all 
perturbation diagrams at once. 


(2 


— 


(Although the Grassmannian finally provides us with the conceptual frame- 
work in which to treat the entire perturbation series as a whole, we still do 
not know how to sum the entire series. For example, phase space reduces the 
set of all possible positions and momenta of a particle to a series of points. 
Thus, phase space, in principle, contains all possible motions of all possible 
particles in the Universe. However, this says everything and it says nothing. 
We still have to impose equations of motion and boundary conditions to ex- 
tract any meaningful information from phase space. The same applies to the 
Grassmannian. ) 

This ambitious program requires fully exploiting the modular transforma- 
tions of Riemann surfaces of genus g. Of particular importance is the mapping 
class group MCG, which reduces to the modular group SL(2, Z) for the torus. 
What we want is a way of studying the mapping class group for arbitrary 
genus. The key will be the construction of theta functions defined on Riemann 
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surfaces of arbitrary genus. This is in contrast to the Schottky method that we 
studied in Section 5.6, where modular invariance was not so obvious. 

In Fig. 5.12, we have written the a- and b-cycles for an arbitrary closed 
Riemann surface, which we call the “canonical homology basis.” Let the an- 
tisymmetric symbol (a, b) represent whether or not two cycles intersect. It is 
equal to 0 if they do not intersect and equal to +1 if they do. For example, 
consider the torus, where we have only the a- and b-cycles. Notice that (a, b) 
is equal to 1, because the a- and b-cycles intersect, but (a, a) is equal to 0. 
Then the four possible combinations for the torus create a matrix: 


01 (a, a) = (b, b) = 0, 
& ‘) Zs Fesshecseconein (S110) 


The elements of the mapping class group do not change this intersection matrix. 
However, we know, by definition, that the group that leaves this matrix invariant 
is Sp(2, Z). (See the Appendix.) To describe the elements of Sp(2. Z), consider 
a Dehn twist D, created by slicing the surface along the a-cycle, twisting the 
cut by 277, and then resplicing the surface back again. Under this Dehn twist, 
the b-cycle converts into the sum of the a-cycle and the b-cycle (see Fig. 5.13). 
Thus, 


Di(a)'= a; D,(b) =a + b. Goll2t 


Now represent this in terms of matrix language. Let the Dehn twist operate on 
the column vector [a, b}]. Then the Dehn twist can be represented symbolically 


as 
AG ‘a, pal al 541i 
SE ee ae s=(, ig) Cae 


Similarly, we can describe the mapping class group for the two-loop surface 
by taking Dehn twists along a), a2, 6), b> as well as the cycle a, 'ax, which is 
a circular line that encircles the two holes. Using the same reasoning, we can 
show that the Dehn twists, operating on the column vector [a,. );. a». b>] can 
be represented as [38] 
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. Now let us treat the closed Rieman surface of arbitrary genus. The 
intersection matrix can be represented as 


(a;, aj) = (b;, bj) = 9, 
(ai, b;) = —(b;, aj) = Oi: 


This is nothing but a block-diagonal matrix, with equation (5.11.1) in each 
block. The group that preserves this matrix is Sp(2g, Z). We might suspect, 
therefore, that the mapping class group is just Sp(2g, Z). This is not quite right. 
There are also Dehn twists D, around cycles that are homologically trivial and 
do not transform the a- or b-cycles. Thus, they are represented as the unit 
matrix in the basis that we have been using. Nevertheless, these Dehn twists 
are legitimate global diffeomorphisms that must be included in the mapping 
class group. This subgroup is called the Torelli group T, and hence we finally 
have the desired result: 


(5.11.5) 


MCG 
—— = Sp(2z, Z). (5.11.6) 


(Fortunately, the effect of the Torelli group on spin structures is trivial, so we 
will delete further discussion of it.) 

Next, we wish to describe the period matrix for the Riemann surface and 
how it transforms under Sp(2g, Z). In addition to the 2g cycles we can write on 
the Riemann surface, we can also write 2g independent harmonic one-forms 
@; and @; on the surface (see Appendix). Because we have equal numbers of 
cycles and one-forms (due to the Hodge-de Rahm theorem), we can always 
normalize the integration over a-cycles such that 


; / w; = 8j. Git 


In general, the b-cycle integration will produce a g x g square matrix: 
[ oj = 9%, (5.11.8) 
bj 


called the period matrix, which generalizes the variable t introduced for the 
single-loop amplitude in (5.5.2) and equals the period matrix introduced for the 
multiloop amplitude in (5.6.31). It can be shown rather easily that the period 
matrix is symmetric and has a positive definite imaginary part. In general, 
the 5a(g + 1) elements of a symmetric g x g matrix (with positive definite 
imaginary part) generate a space called the Siegel upper half-plane. 

The advantage of introducing the period matrix is that no two inequivalent 
Riemann surfaces can have the same {2;;, which is called Torelli’s theorem. 
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Thus, up to Sp(2g, Z) transformations, the period matrix gives us a convenient 
way in which to characterize different Riemann surfaces. Under an Sp(2g, Z) 
transformation, the period matrix transforms as 


Q = (AQ + BY(CQ+ Dy", (5.11.9) 


where the A, B, C, D are each symplectic g x g matrices. Thus, two different 
period matrices may describe the same Riemann surface if they are related by 
an Sp(2g, Z) transformation. 

Now that we have a mathematical description of the mapping class group 
in terms of Sp(2g, Z) defined on Dehn twists, the next task is to write down 
functions defined over a Riemann surface with g handles. We saw earlier that 
the single-loop amplitude is described in terms of the quasi-doubly periodic 
theta function ©. Now, we wish to write down generalized theta functions that 
have certain periodicity properties defined for the surface with g handles. 

If we take the single-loop theta function and then force it to have these peri- 
odicity properties over the Riemann surface with g handles, we naturally add 
an additional set of infinite terms to the summation. Eventually, this summation 
approaches [39, 40] 


@(z | 2) = ~~ ein Qnt2rinz (5.11.10) 
neZs 
where the sum over the vector n is taken over a g-dimensional lattice. The 
theta function is defined not over the complex variable z, but over the vector z 
defined by 


za || Ww, fee) 
Po 

where po is an arbitrary point on the surface. Since w has g components .w,, 
then z is also a vector with g components 2;. 

It also possible to construct theta functions with spin structures. For the 
torus, we saw that a spinor parallel transported around a parallelogram picked 
up a phase of +1 (—1), depending on whether the spinor was periodic (anti- 
periodic). Likewise, the genus g Riemann surface can be represented as a 
polygon with 4g sides (Fig. 5.14). We now have to define 2-* phases in order to 
describe the different ways in which we can parallel transport a spinor around 
this polygon. The sect of characters [w] = (a. b) defines the spin structure of 
the manifold. 

We can now define the generalized theta function with spin structure: 


Ofa](z |Q)= > eit (n+a)-2.(n+a)+27i(n+a)(z+b) 
neZs 


= ete aria Eee SO ab | Q2). (5.1 1.12) 


Under a shift in the lattice, we find that the generalized theta function is periodic 
up to a phase: 


Ole} (e tb 82 yen | 2) ee ee lz iia) 
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FIGURE 5.14. Canonical homology cycles of the Riemann surface of genus 2. This 
figure is formed by drawing the a and b homology cycles of a sphere with two 
handles or holes, cutting along the lines that form the cycles, and then unraveling 
the surface. The advantage of this basis is that the homology cycles for a surface of 
arbitrary genus can be represented as a polygon. 


Under the change z — —z the theta function transforms as 
O[a](—z | 2) = (—1)*Ofa](z | 2). (5.11.14) 


We can thus categorize spin structures, as we did for the single-loop function 
in (5.9.7), according to whether or not they are even or odd under z > —z. 
We find that there are 28—'(28 — 1) odd and 28—'(28 + 1) even spin structures 
on a Riemann surface of genus g. 

It is now astraightforward task to calculate how these generalized theta func- 
tions transform under a mapping class group transformation (5.11.9). Under 
Sp(2g, Z), we find 


Ofa](z’ | 2’) = ee !7? det(CQ + D)'”Qla](z | Q), (S115) 
where 
: zZ=(CQ+D)'-z (5.11.16) 


t ‘ T 
hee eae |. (5.11.17) 
b’ =B A b DAB ),, 


and the phase factor is given by 
¢ = aD! Ba+bC’ Ab — 2aB' Cb — (aD’ — bC’)(AB")a, 


where T represents the transpose and d represents taking the diagonal element. 
The symbol ¢ represents the eighth root of unity (with a certain restriction that 
does not worry us here). 

_ Now that we have defined theta functions with the proper periodicity on 
the Riemann surface, our next task is to actually calculate the measure of 
the multiloop amplitude. Here, we are greatly aided by a remarkable result 


and 
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of Belavin and Knizhnik [34], which simply states that the measure of the 
multiloop amplitude is the square of a certain holomorphic function (up to 
zero modes): 


AG 
holomorphic factorization: Z= / Las (5.11.18) 


r (det Im Q)!3’ 
where F is the fundamental region of the surface, and 7 is a3g — 3 form: 
32-3 
i dy; F(y), GI) 
i=l 
where y; is a parametrization of the Teichmiller parameters, and F(y) is holo- 
morphic, has no zeros, and has second-order poles at the boundary where 
the surface degenerates. It is easy to check, for example, that the single-loop 
function (5.5.4) possesses this property (up to zero mode terms). 

The statement of holomorphic factorization, on one level, is intuitively obvi- 
ous. It simply says that the closed loop is a product of the left- and right-moving 
modes of the string, except for the zero modes. The proof of this powerful re- 
sult, however, is rather involved, but can be summarized as follows. Using the 
techniques derived from the previous sections, we know how to formally write 
the measure of the multiloop amplitude in terms of Teichmuller parameters 
and complicated determinants. Then, it can be shown that 


09 log W = (D — 26)---, (5.11.20) 


where W is the measure term up to the factor raised to the thirteenth power. 
Notice that the right-hand side is the conformal anomaly term, which vanishes 
in 26 dimensions. Thus, in 26 dimensions we have 00 log W = 0, in which 
case it can be written as the absolute value of a holomorphic function | F|*. 
When we are in any other dimension, there is an obstruction to holomorphic 
factorization. Then we say that there is an analytic anomaly. 

What is remarkable is that the conditions on F are probably stringent enough 
to uniquely fix the measure for all multiloops [41-61]. This is an enormously 
powerful result, which may eventually allow us to write the multiloop measure 
by inspection. 

For example, consider the two-loop function. Let us choose the Teichmiuiller 
parameters as the period matrix itself, since they both have three independent 
complex variables. Then the following combination is modular invariant: 


dQ 
(det Im Q)3 


We wish to find an expression for | F'|’ (det Im Q)~'°. Using arguments from 
the theory of Riemann surfaces, we find that the unique answer is 


Glo 


—2 
two-loop: F = [Jomo in| (Sul 23) 
a,b 
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where the product is taken over the 10 even characters such that 4a- b = 
0 mod 2. 

Likewise, using similar arguments, about holomorphic functions, we can 
represent the F function for the three-loop amplitude as 


=P 
three-loop: F = (1 eraio| 9} (5.11.23) 
a.b- 


where once again the Teichmiiller variables dy; are taken to be the elements 
of the period matrix dQ;;. 

This simple procedure, however, stops abruptly at the four-loop level. In 
general, there are }¢(g + 1) elements in a symmetric matrix defined in the 
Siegel upper half-plane, while the number of Teichmiiller parameters is 3g — 3. 
Thus, they have the same number of elements only for g = 2, 3, which makes 
a characterization of the period matrix of higher loops exceedingly difficult. 
This is the origin of the Schottky problem, the fact that the set of elements 
in the Siegel upper half-plane agrees with the elements in the period matrix 
only for g = 2,3. The question is: What conditions do we have to place on 
our functions defined in the Siegel upper half-plane such that they become 
functions of the period matrix? 

Fortunately, mathematicians have recently solved the Schottky problem, 
making it possible to simply characterize period matrices for arbitrary g and 
hence describe holomorphic functions defined over Riemann manifolds of 
arbitrary genus. In fact, this allows us to envision an infinite-dimensional space, 
the Grassmannian, where Riemann surfaces of arbitrary genus are represented 
as points. 

What we want is a generalization of an operator formalism for a conformal 
field theory defined on a Riemann manifold of arbitrary genus. We want to 
construct operators acting on a generating function [35—37] that will allow us 
to derive all correlation functions defined on surfaces of genus g. Let us begin 
by defining fermion operators 


Ui ee (5.11.24) 
with the usual anticommutation relations 
{Wns} = On 6.1125) 
and the bilocal current 
AZ Ay ==” (OD), (oalT25) 


The current operator is the diagonal element 


1 Cae | el aaa (5.11.27) 


neZ 
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Now define the generating function 


t(x) = (0| e#g |0) , (5.11.28) 
where 
HG) = Yo (5.11.29) 
n=0 


g is an element of the Clifford group given by 


c= ep ¢ a: f dau f (gc WIS wf. (S1130) 


where the integral is taken around infinity. [t is important to know that g is 
a function of the period matrix Q as well as the spin structure defined over 
the surface, which is parametrized by the characters a and b. Thus, all the 
information concerning the nature of the Riemann surface is encoded within 
the function g. 

In ordinary point particle path integrals, we know that the action of the 
operator 5/5J on the generator of the Green's function creates the insertion 
of a field into the functional integral. The analog of this is given by the vertex 
operator 


v(z) = ern=0%n2" glogzdo+ 2 na eratid: (5.11.31) 


where x,, 0, have the same commutation relations as the usual harmonic 
oscillators. By construction, we have the identity 


(0] w(z)y*(w)g |9) 

(0| g |0) 
Thus, the action of the vertex operator is to create the correlation function 
for the conformal field theory, similar to the action of 5/SJ on the generating 


function in ordinary field theory. 
Now let us impose the conditions, called the Hirota equations, 


= t!(x)v(z)u*(w)t(x) | <0 - (5.11.32) 


f dzvu(z)t(x)vu*(z)t(y) = 0. (5.11.33) 


The Hirota equations can be shown to be intimately linked to the equations of 
the KP (Kadometsev—Petviasvili) hierarchy. 
We now have all the tools to write an explicit expression for T(x) [35-37]: 


ie.2) 
T(x) = @Xnwz0% Qnntn Of] (- S- Anka | 2] (5.11.34) 


nl 


where the A,, appear in the power expansion of the period matrix: 


Q(z) = Anta dz (5.135) 
n=0 
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and 


E(t, y) 
(E=y) 


Onm = $[(m = 1)! n(n — 1)tT-1a""9" log 


(5.11.36) 


t=y=0 


where E(t, y) is called the “prime form” and is a straightforward generalization 
of the function z — w found in the conformal field theory of the sphere. In fact, 
for z near w, it behaves like z — w even ona surface of genus g. Explicitly, it 
equals 


ore w) = Ofa] ( / w | a) (he(z)ha(w)) 7, Galle) 
where 


h,(z) = wy 9; O[a](0 | 2)e;(z). (5.11.38) 


Now we come to the main result of this discussion. We find that the tau function 
satisfies the Hirota equations of the KP hierarchy if and only if Q is, in fact, 
the period matrix of a genus g Riemann surface. This is the constraint that we 
were looking for. 82 which has dimension ! g(g + 1), can now be restricted to 
dimension 3g — 3 and is equal to the period matrix of a Riemann surface of 
genus g. This, in principle, solves the Schottky problem (although the result 
is highly nonlinear). 

As an added bonus, because t(z) can be viewed as the generating function 
for conformal field theory defined on an Riemann surface of genus g, we can 
calculate arbitrary correlation functions. The matrix element of two fermions 
on this Riemann surface is given by 

(Ol vezyw*(w)g 10) _ _ O11) _ ope ay (5.11.39) 
(O| g |0) O[a](0 | QVE(z, w) 
Mathematicians know this as the Szego kernel, which is the unique mero- 
morphic half-differential in z and w with a single pole with residue | for 
z = w. Physicists know this simply as the two-point function. The N-point 
generalization is equal to 


(0] w(zi)W*(w1) = ++ Wn) W"(Wn)g 10) 


(0| g |0) 
=] 2G. z) [] Ex. wO"[a}(0 | 2) 
i<j kel 


x [Te w,)Q[a] (> ope pa | 2| (5.11.40) 
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There is a way in which to check the consistency of these equations. We know 
that there are two ways in which to calculate the N-point function, in terms of 
either fermionic oscillators or, through bosonization, bosonic oscillators with 
the form e’®. For example, by using the Wick decomposition on the N-point 
function, we can write it terms of the two-point function as 


N N 
OL] [vd] [ v*Qey)g 10) = det; Pa(z:, w,). (5.11.41) 


i=! j=l 


Equation (5.11.40) can be obtained using the bosonic representation, while 
(5.11.41) can be obtained using the fermionic representation. Fortunately, the 
equivalence of these two different expressions was proved by Fay [39]; this is 
called the “trisecant” addition theorem for theta functions. The mathematical 
equivalence of these expressions is another check on our bosonization scheme. 

In summary, what we have developed is an operator formalism for conformal 
field theory defined on an arbitrary Riemann surface. The ket vector g |0) 
represents a specific Riemann surface, and applying various field operators 
on it corresponds to taking matrix elements over that Riemann surface. It is 
important to note that each element T(x), if it satisfies the Hirota conditions for 
the KP hierarchy, now correctly characterizes a Riemann surface of arbitrary 
genus g. Thus, each t(x) defines a point in our Grassmannian, which is the 
desired result. 

Notice that we are still a great distance away from our final goal, which ts 
to sum the perturbation series and extract nonperturbative information from 
it. However, we have made a significant step in this direction because we can 
now characterize every Riemann surface of arbitrary genus. including its spin 
structure, as a point in a Grassmannian space. With this operator formalism. 
we can generate points on the Grassmannian at will. It remains to be seen how 
useful the Grassmannian will become in the future. 

We should also mention some difficulties with this formalism. Although the 
Schottky problem is now formally solved, in practice, the Hirota equations are 
quite nonlinear, so it remains to be seen precisely how practical this solution 
will be in the explicit construction of multiloop amplitudes in terms of theta 
functions. Furthermore, we must mention that there exists a certain amount of 
confusion in the literature concerning how to detine the supermoduli space for 
the superstring amplitudes. Although the super-Riemann-Roch theorem tells 
us how many supermoduli there are, there still remains the difficult problem 
of constructing them explicitly in a well-defined way. For an introduction to 
supermoduli space and its problems, see [62]. 

In the next series of chapters, we will discuss an alternative to the Grass- 
mannian approach, which requires no information about the genus g at all. In 
this approach, in fact, the entire perturbation series on Riemann surfaces is 
only one possible approximation scheme. This is the ficld theory of strings. 

Before closing this chapter, we should mention that, of the various multiloop 
formalisms discussed so far, only the light cone formalism can be naturally 
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FIGURE 5.15. In the last diagram, we see the angles, lengths, and “times” that 
parametrize the light cone surface. These parameters make up the Teichmiiller space, 
and six of them are required to parametrize each internal loop. (These parameters 
automatically yield one cover of the fundamental modular region.) 


derived from a field theory of strings. We thus intuitively expect that the con- 
struction of light cone Feynman diagrams by assembling vertex functions in 
the S-matrix yields a specific “triangulation” of moduli space. This result, 
which was proven in [63, 64], was never previously considered by the math- 
ematicians. (Briefly, the proof is based on constructing certain integrals on a 
Riemann surface of genus g and calculating their periods as we move around 
each loop or handle. Then it can be shown that the periods are purely imaginary, 
which yields a modular invariant description of the surface.) 

For example, in Fig. 5.15, we see how a multiloop surface is parametrized 
with light cone parameters, with t; representing the various interaction times 
when the string splits or joins, and 6; representing “twisting” each string by 
one full revolution. To complete 6g — 6 parameters for a surface of genus g, 
we also need to integrate over the circumference of each cylinder. Thus, all 
6g — 6 moduli parameters have a natural interpretation in terms of the phys- 
ical parameters of a Feynman diagram. The light cone formalism has several 
advantages over the previous formalisms. First, there is no need to truncate the 
integration region, which is necessary in the Schottky and the constant cur- 
vature formalisms for modular invariance. Second, it easily generalizes to an 
arbitrary number of loops (which cannot be said for the theta function method.) 
Third, it is unitary and based on physical variables. And fourth, it is naturally 
derived from a field theory. 

Instead of discussing this formalism, we will discuss the light cone method 
’ in the context of the field theory of strings. We will devote the next series of 
chapters to this important topic. 
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In summary, we have shown that the unitarization scheme for the string can 
be implemented by using the unitarity equation, using the N-point amplitude 
as the Born term: 


] 
Bed ey (nines (5.12.1) 
The function integral is now modified to 


N 
A= / DX die? | ler", (5.12.2) 
S i=] 
where we sum over all conformally inequivalent surfaces S that are disks or 
spheres with N holes. In particular, the open string graphs come in three types: 
planar, nonplanar, and nonorientable (as in a Mobius strip). The closed string 
graphs come in only planar and nonorientable (as in the Klein bottle). 
Fortunately, mathematicians have already calculated the Neumann functions 
on these surfaces, which are particular examples of automorphic functions. 
For the first loop, the Neumann function is given in terms of 


; 
leer 


2 In w 


In w(x, w) = In(i — x) + ; Inx + 
+ 5 “[in(l — w"x) + In(1 — w"/x) — 2In(1 — w")]. (5.12.3) 
t=) 


Putting everything together, the first loop planar amplitude is equal to 


1 N-1 
Ania deo | al O(Vj41 — ¥;) dv; 
0 j=] 


de (a= i ae 
; (q°)~ [| pp, oe 
«| q° ( Inq ) iq) ee (5.124) 


We found that the string graphs never have ultraviolet divergences. The 
infinite sum over the intermediate Regge trajectories renders the graphs finite in 
the ultraviolet region. However, as in Feynman graph theory, the divergence of 
a graph is given by the local change in topology of the surface. The single-loop 
open string graph diverges when the interior hold vanishes. This corresponds, 
by conformal invariance, to extracting a closed loop with zero momentum. 
Thus, the divergences correspond to a tachyon or a dilaton scattering into the 
vacuum. 

The divergence structure of these graphs are as follows: 


(1) The Nambu—Goto open string diverges as g~* and g_'. The latter diver- 
gence, associated with a dilaton, probably can be absorbed into a slope 
renormalization to all orders. The tachyon divergence is more troublesome. 
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The nonplanar graphs, when analyzed in the complex s- and t-plane, actu- 
ally contain cuts in them, which reduces to poles in 26 dimensions. These 
poles correspond to closed strings. Thus, the open string theory, by itself, 
is incomplete. Closed strings emerge as “bound states” at the first-loop 
level: 

(2) The closed string amplitude diverges because of tadpole or self-energy 
insertions on external legs, which are on-shell. 

(3) The Type I superstring has only g~' poles, and hence we can have slope 
renormalization to absorb this infinity. The g~? divergence never occurs 
because the boson and fermion internal legs cancel against each other. 

(4) The Type II superstring is actually finite. There are no two- and three-point 
single-loop graphs, and hence tadpole or self-energy insertions on external 
legs are simply forbidden. 


These results generalize nicely to the N-loop amplitude using the theory 
of automorphic functions. There are several parametrizations we can use. The 
first is the Schottky group method, which has the advantage that its choice of 
variables is explicit and was derived in a manifestly factorizable fashion by 
sewing multi-Reggeon vertices together. However, modular invariance is not 
so obvious. Another is the theta function method, in which modular invariance 
is built in from the very start. It also easily generalizes to spin structures. 
The disadvantage of this method is that the choice of variables is far from 
clear. Also, factorization and hence unitarity are obscure. These amplitudes 
are simply postulated by hand to satisfy the boundary conditions, rather than 
being constructed by sewing vertices together. eva these methods are 
probably identical. 

Let us start with the complex plane with 2/N arbitrary holes cut out. Let us 
pair up the holes into N pairs, labeled a; and a;. These are called a-cycles. If 
we cut a line between any pair of a-cycles, we obtain b-cycles. Let us define 
projective operators P; that map one a-cycle a, into its partner a;. Then these 
projective operators can be parametrized by two invariant points z; and z2 and 
a multiplier X, such that 


(5.12.5) 


For the open string, the centers of the a-cycles lie on the real axis, as well 
as the invariant point of the projective transformations. The final open string 
N-loop amplitude is given by 


N 
An = [Ta du} [] [a - Xa)” 
a=1 


{P} (Pp) 


I] (z; - {P}z, ems ky I i, 1 


l=i<j=M p.A=I 
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aT PUS geo , (5.12.6) 
{P}x, {P}x; 


where 


M N 
di |i dzdVi | eeu a) Gulez) 


a=1 


where the Roman letters refer to the external lines and the Greek letters rep- 
resent the loops. Notice that the region of integration lies between the limit 
points of the set {P}. We must have these limit points forming a discrete region 
on the real axis. Thus, we must have Schottky groups. 

The N-loop amplitude can also be directly reformulated in the language of 
Riemann surfaces if we use the Polyakov action instead of the Nambu—Goto 
action. There are several new contributions to the integral: 


(1) A factor Arp coming from the Faddeev—Popov determinant, which can be 
rewritten as 


det!/? P* Py. (5258) 
(2) A factor from the integration over the X variable: 
[ox exp (- [ @evaerax"ax, 
In —(1/2)D 
2 
= (see det'(—V ) ‘ (5.12.9) 


where 


=] 
vee yon BB ae (5.12.10) 


(3) The Teichmiuller parameters ¢; must be added to the integral. To calculate 
this last factor, let us write 


b8ap = [VadUp + Vpdvq — (V-5°) gan] 


+ (V,5U°) gap + 2804p + dt;T', (SQ) 
where #; are the Teichmiller parameters and 
;  98ab 
ee Se — (trace). (5.12.12) 


We can rewrite this as 


58ah = P\(5V)an + 200 ga + St! 0; 8a, (5.12.13) 


5.12 Summary 241 


where 

P,(8v)ah = Va5Up + Ved0q — Sap VeSV°. (5.12.14) 
We change variables to 

=v. T)+ Pu, - (5.12.15) 

where 
i Sp 5.12.16 
Ppt’ (5.12.16) 
WT) = f devEeM eT oy (Gn12 17) 


Our final result for the measure is therefore 
Dg ap Q25iq 2 yey =D Va pig DO OR 
det(w?, T') 
det'?(y2, w") 


—(1/2)D 
Gear det’ (— v4) . (5.12.18) 


x Dt' det'?(P! P,) 


The great advantage of this approach to the multiloop amplitude is that 
we can use the power of Riemann surfaces to analyze the divergences of the 
amplitude. In particular, we can analyze the singularities of the Selberg zeta 
function: 


Z@=e |] [ [i we"). (5.12.19) 


{ primitive p=0 


Analysis of the Selberg functions confirms the fact that the measure has a 
singularity where the topology of the genus g surface degenerates. 

Fortunately, the construction of the measure on multiloop surfaces 1s greatly 
facilitated by the holomorphic factorization theorem, which states that the 
measure is equal to 


ms) Dee iene 5.12.20) 
Jr (det Im 2)3’ a 


where 7 is a holomorphic 3g — 3 form. The formula is intuitively obvious if 
we consider the equal contribution of left- and right-movers to the amplitude 
(except for zero modes). The only complication to the proof is an anomaly 
(called the analytic anomaly) that vanishes in 26 dimensions. Using this result, 
we can basically guess the answer for the integrand of the multiloop amplitude, 
' since the function with the correct periodicity and holomorphic singularity 
structure must be unique. 

Another method, based on theta functions, exploits the modular properties of 
these periodic functions from the very beginning. For surfaces of genus g, we 
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have 27% spin structures, corresponding to all possible periodic (antiperiodic) 
boundary conditions on the surface. Our task is therefore to construct Neumann 
functions on this surface with spin structure that have the correct periodicity 
properties and singularities. The answer is given in terms of two functions, the 
generalized theta functions and the prime form. The theta function is given by 


C[a](z | Q) = an eft (nta)-2-(n+a)t+2ni(nta)(z+b) | (5.12.21) 


neZs 


where @ represents the spin structure and the prime form (which is the 
holomorphic generalization of z — w found for the sphere) is given by 


E(z, w) = ©[a] ([ | a) (ha(z)hg(w))?, (S222) 


where 
ha = Y_ 3; O[a](0| Q)a,(z). (5.12.23) 


Given the theta function and the prime form, we can calculate all possible 
Green’s functions for bosonc and fermionic operators on Riemann surfaces. 
Thus, we can construct a new conformal field theory on Riemann surfaces, not 
just spheres. For example, the two-point function of two fermions is given by 
the Szeg6 kernel: 


Ofa](f, @ | 2) 
Of] QVE(z, w)’ 


One difficulty in solving the multiloop problem is the question of choosing the 
correct coordinates. One ideal choice would be to use the period matrix Q as 
the coordinate. The problem, however, is that there are 4 2(¢ + 1) dimensions 
to a square matrix like the period matrix, but only 3g — 3 complex Teichmiiller 
parameters. For two and three loops, they coincide, but for higher loops they 
do not. This is the Schottky problem, which only recently has been solved. 
The solution to this problem makes possible the use of Grassmannians to char- 
acterize the entire perturbation series. Each multiloop amplitude with spin 
structure represents a single point in the Grassmannian, so we may (at least in 
principle) have a way of manipulating the entire perturbation series over Rie- 
mann surfaces all at once. This may, in turn, eventually vield nonperturbative 
information about the superstring theory. 
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Part Il 


Second Quantization and 
the Search for Geometry 


CHAPTER 6 


Light Cone Field Theory 


6.1 Why String Field Theory? 


In Part I, we saw that the development of the first quantized theory often 
appeared disjoint and seemingly random, appealing to rules of thumb and 
folklore that often seemed quite arbitrary. The choice of vertex functions, the 
measure of integration, the counting of diagrams, etc., all were inserted by 
hand. In summary, the first quantized theory suffers from several important 
defects: 


(1) The interactions must be introduced ad hoc, without any rigorous, overall 
motivation. 

(2) Unitarity cannot easily be shown in the first quantized approach. There 
is no Hermitian Hamiltonian from which we can derive the interacting 
theory. 

(3) The theory is necessarily perturbative and on-shell, which makes difficult 
the calculation of nonperturbative effects. 


Most important, the first quantized string theory is unsuitable for a cal- 
culation of dynamical symmetry breaking. Unfortunately, to all orders in the 
coupling constant, the dimension of space-time seems to be stable. This means, 
of course, that there is little hope within the first quantized approach of calcu- 
lating dynamical breaking of 26 or 10 dimensions down to four dimensions. 
Thus, it is doubtful that rigorous phenomenology can be done within the first 
quantized framework. The first quantized theory, although it can yield virtually 
tens of thousands of classical solutions, cannot select out the true vacuum from 
among the classical vacua that the model can exhibit. 

Thus, in Part II we now turn to the field theory of strings [1-5], which has 
the promise of yielding a nonperturbative formalism in which the true vacuum 
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may be found. (We should point out, however, that second quantization has 
been developed only for string theory, not M-theory. In fact, it is not known 
how to formulate M-theory in a satisfactory first quantized formalism, let alone 
a second quantized one. Thus, our remarks about second quantization will be 
confined strictly to superstrings.) 

At first, the second quantized approach seems to be totally redundant to the 
usual first quantized approach. Perturbatively, we simply reproduce the same 
diagrams as the first quantized approach. However, there are several important 
advantages to the second quantized field theory: 


(1) Interactions are introduced through a new gauge group whose elements 
close on interacting strings. There is now a group-theoretical motivation 
for introducing the interactions of string theory. 

(2) The theory is manifestly unitary because the Hamiltonian is Hermitian. 
All weights of perturbative diagrams are fixed at the very beginning. 

(3) Most important, we have, in principle, a method for calculating dynamical 
effects in the theory. 


Historically, it was once thought that a field theory of strings was impossible 
because it would violate a host of fundamental and cherished principles of 
quantum mechanics. Specifically: 


(1) A field theory of strings would be a nonlocal theory riddled with fatal 
problems, such as the violation of causality. 

(2) A field theory of strings would necessarily be off-shell, yet the crucial 
properties of the Veneziano model, such as cyclic symmetry, only hold 
on-shell. Thus, a field theory of strings would not reproduce the dual 
model. 

(3) A field theory of strings would not be both Lorentz invariant and unitary 
at the same time. This is because a quantization program for strings does 
not exist that is both Lorentz invariant and unitary off-shell. For the first 
quantized theorem, this is not important because the theory is on-shell. But 
the field theory of strings is necessarily off-shell and hence must violate 
either Lorentz invariance or unitarity. 

(4) Most, important, a field theory of strings would be plagued with violations 
of unitarity because of severe problems with overcounting of perturbation 
diagrams. Field theories add the sum of s- and r-channel poles separately, 
which violates duality. 


Fortunately, a field theory of strings that answers each of these four 
objections is, indeed, possible. 

First, string field theory does not violate causality because the interactions, 
such as the breaking of a string, take place instantaneously. Furthermore, the 
information concerning the change in the topology of the string travels along 
the string at or less than the speed of light. In other words, string theory is 
multilocal. Thus, string field theory is the only known nonlocal field theory 
consistent with the principles of quantum mechanics. 
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Second, string field theory generates Green’s functions that necessarily vi- 
olate some of the important properties of the Veneziano model. But this is 
irrelevant, because these Green’s functions correctly reproduce the Veneziano 
model on-shell, and only on-shell matrix elements can be measured. 

Third, the BRST method explicitly breaks unitarity off-shell with Faddeev— 
Popov ghosts. This makes no difference, however, because the Faddeev—Popov 
ghosts cancel with the unitary ghosts on-shell. Similarly, the light cone method 
maintains unitarity but breaks Lorentz invariance off-shell. The point is that 
on-shell, the theory is both Lorentz invariant and unitary, so the theory is still 
well defined. 

Fourth, string field theory actually breaks duality off-shell, but this makes 
no difference on-shell. For example, in light cone string field theory we sum 
separately over s- and t-channel poles in the Feynman series: 


Ay Aj 
A 200, bli! 
“WD ae (6.4.1 


T 


The field theory of strings solves the problem of double counting by breaking 
up the amplitude into its separate r- and s-channel parts, consistent with a 
string interpretation (see Fig. 6.1). Thus, string field theory breaks manifest 
duality, but recovers it at the level of the S-matrix. Although string field theory 
explicitly solves the problem of unitarity, it does so at the price of breaking 
manifest duality, which is recovered only at the end when we sum over all 
Feynman diagrams. 

For closed strings, we also find that the individual Feynman diagrams break 
manifest modular invariance. Only the sum is modular invariant. Thus, the light 
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FIGURE 6.1. Surfaces generated by the light cone field theory. As in any field theory, 
we now sum over s- and t-channel graphs separately, breaking duality. Only the sum 
of these two graphs is dual. Thus, the light cone field theory of strings solves the 
‘problem of double counting: duality is explicitly broken for each diagram. Only the 
sum (the S-matrix) is dual. 
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cone field theory is manifestly unitary (because the Hamiltonian is manifestly 
Hermitian) but the price we pay is that we break manifest modular invariance: 


First quantization (modular invariance) —> second quantization (unitarity). 


We will begin a discussion of the second quantized theory string starting with 
the light cone formalism [1], because this retraces the historical development 
of the theory and because it is the most fully developed formalism. However, 
because the light cone gauge is a gauge-fixed formalism, in which all the local 
gauge degrees of freedom have been explicitly removed, there is no trace of 
the elegant group-theoretical formalism from which string field theory can be 
derived. Thus, for pedagogical reasons we begin our discussion with the light 
cone field theory. 

We begin by once again discussing the field theory of point particles. We will 
trace how Feynman derived the Schrodinger equation from the first quantized 
theory of point particles. 


6.2 Deriving Point Particle Field Theory 


In Chapter 1, we started with the path integral for a point particle moving from 
a point x; to point x;. Associated with each path connecting these two points 
was a phase factor e'’. The fundamental postulate of quantum mechanics is 
that the probability amplitude of a particle moving between these two points 
is the sum over all possible phases associated with each path. After a Wick 
rotation, we have 


ay =f Dxe-, (6.2.1) 
where 
s=| dt (4mv; — V(x)). (6.2.2) 


The evolution of a quantum-mechanical wave, by assumption, obeys Huygens’ 
principle 


w(x;, t;) = ih A{(x;, tj5 Xi, ti )W(x;, t;) dx;. (6.2.3) 


io @) 


Now we would like to calculate the variation of this wave functional after a 
small displacement ¢ in time. Earlier, we found that the propagator was equal 


to 
‘ m rc 2 Tie exe) a 
i) =a ee ex ee SS be ola 
1 {| 2x(t; — t) P th — t; Czy 
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For small time intervals ¢ = t; — t;, we find, therefore, 
foe) i = ah! 
w,t+e)= ‘| 7 ee exp | ME 
=O) 
where A is a normalization constant. Remember that the time interval ¢ is very 
small, but the separation between x and y need not be small. 


We want to keep only terms of order ¢. If we let y = x + 7, where n is not 
necessarily a small number, the integral becomes 


| wy, t)dy, (6.2.5) 


W(x,t+e)= i Aleit 22 wx + n, t)dn. (6.2.6) 


It is important to notice that there is no limit, in principle, to the size of 7. 
However, in the functional integral, keeping terms of order e will necessarily 
restrict us to terms second order in 7. Now let us power expand the left-hand 
side in terms of ¢ and the right-hand side in terms of 7: 


0 90 
won tert = [ An eimn’ /2€ 
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i) a? 
epee ee ae (6.27) 
Oka i Oe 
This integral can be explicitly performed. First, the integration constant can be 
determined to be 
omie\? 
A =| =) . (6.2.8) 
= 


Writing out the terms in the right-hand side, we notice that the only terms that 

survive the Gaussian integration are those for which n appears with an even 
power in the integrand. Finally, we are left with 
aw 1 ay 

i— = -— ; 6.2.9) 

: ot 2m dx? ( 

Thus, we have now derived the Schrédinger equation, starting only from the 

assumption that L = tmx}, and the basic assumptions of quantum mechanics. 

If we include the effects of a potential term and extend the expression to all 

three spatial directions, the derivation is basically unchanged, and we arrive at 


hae = See + V(x). (6.2.10) 
a 2m 

We can also dispense with external potentials entirely and introduce y?- or y*- 
type interactions directly into the action. This is the second quantized analog 
of summing over Y-shaped or X-shaped topologies in the first quantized point 
particle theory. 

We have followed Feynman’s original derivation of the Schrodinger equa- 
tion, based on calculating the time evolution of the wave. However, there is yet 
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another way in which we can make the transition from the first to the second 
quantized formalism in which the derivation is more direct. Let us now turn 
to the path integral action and show that we can make the transition to the 
second quantized formalism starting from the Green’s functions themselves, 
rather than appealing to the equations of motion. 

In Chapter 1, when we made the transition from the Hamiltonian to the 
Lagrangian formalism in the first quantized point particle theory, we inserted 
an infinite set of intermediate states, the eigenvectors of the x-coordinate: 


1 = fein) f Dri (ot (6.2.11) 


into the expression 
Aj; = (Are tle 7) (6.22) 


at each intermediate point between x; and x;. This allowed us to make the 
transition between the Hamiltonian and the Lagrangian approaches. 

Now we wish to replace this with the integration over a complete set of 
second quantized fields: 


|= Iw) | D2yev™ (wl. (6.2.13) 
Following (1.8.21), we define 
W(x) = (xl), 
w"(x) = (wix), 
Dy =| [dv@)ay*(a). (6.2.14) 


Thus, at each intermediate point between the initial and final states of a point 
particle, we are now going to introduce an infinite set of intermediate functional 
states |y) rather than x-cigenstates |). The easiest way to check the validity 
of (6.2.13) is to take the following matrix element and insert a complete set of 
intermediate string states: 


8(x — y) = (xly) 
= (x| 1 ly) 


= xb) f Dive |— fovier f dec} wi 
= | Pwrowmer|- f rowan (@2. 15) 


(where as usual we drop overall normalization factors in the path integral). We 
treat w(x) as an element of a column vector labeled by discrete elements vx. 
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Discretizing the expression, we find 


Bey ~ i [ avs aveviyy exp |- Sy ve | (6.2.16) 


This last relation is just (1.7.11). Thus, we have now shown that we can move 
interchangeably from first quantized basis elements |x) to second quantized 
basis elements |W), such that w(x) = (x|w). 

Now let us begin our derivation of the Green’s function for the Schrodinger 
equation entirely in terms of second quantized field functionals, without re- 
course to the equations of motion or Huygens’ principle. We will insert the 
following identity at every intermediate point along the path: 


ia oi / DPD? exp—Wi(a — th) — VeVi) Wel. (6.2.17) 


(We can prove this identity by functionally integrating the expression over yw, 
which then reduces to the completeness expression written in terms of the y 
field.) 

We insert the above expression between two infinitesimally close position 
eigenstates in (6.2.12): 


Nijow= (x, | en le) 
= (x|x2) — i (x1| Hdt |x2)+--- 


- i Dvn (xi|v1) (Walee) expl—Vi (2 — Wi) — VENI] 


ay if D?yiosalxil vn) (Wal Ht |Ws) (Wala) 


x exp[—Wi (Wn — Wi) — Wh — 3 (Wa — Ws) — WG 3] 
“eee (6.2.18) 


where the subscript 12 or 1234 simply means the product of two or four of 
these functional differentials. We now take the limit of small time separations: 


3 Vi Wisi — Vi) > ‘in ww dt. (6.2.19) 
i=l 
After taking the limit, we find 
Aw = ‘| D? wy (x )W* (xy) exp |! fava = Hw (6.2.20) 
Thus, our new Lagrangian is 
La [i> — al w, (62:21) 


. which is the Lagrangian for the Schrodinger wave equation. Therefore we have 
derived the Schrédinger equation, given the postulates of quantum mechanics 
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and the classical first quantized formalism for a point particle, without making 
use of the equations of motion or Huygen’s principle. 

The main reason why we went through this analysis for the point particle is 
that we will now repeat the same steps for the light cone and the BRST field 
theories. Surprisingly enough, we will find that this entire functional apparatus 
carries over directly into the string field theory for the light cone and the BRST 
actions. 


6.3 Light Cone Field Theory 


As in the point particle case, we now want to make the transition from the first 
quantized to the second quantized string formalism, using (1.8.21): 


(X|@) = P(X). (6.3.1) 


We must be careful to state that the field functional is not a local function of 
X(o+) at a specific point o on a string. By contrast, it is actually a mu/tilocal 
functional defined at all points along the string. If we discretize the string into 
a series of points: 


(Alm (2). O3,.--,O0Nn, (6.3.2) 
then the field functional becomes 
O(X) = O[X; (01), Xi(02), Xi(03), ..., Xi(on)] (6.3.3) 


and we take the limit N — oo. Thus the string functional is simultaneously a 
function of every point along the string [1]. 

Let us begin by defining the Hilbert space of string excitations. It is conve- 
nient to power expand the field |®) in terms of harmonic oscillators, in which 
case we have 


|b) = o(x) 0) + Aja! |0) + hijal_,a’,|0) +-->. (6.3.4) 


We sce immediately the difference between the first and second quantized 
formalism even at the free level. In the first quantized formalism the ba- 
sic object is X,, which represents just one possible configuration of the 
string. In the second quantized formalism we are dealing with the field 
functional ®, which is simultaneously a composite of all possible string 
configurations. 

To make things concrete, let us now introduce a specific representation of the 
|X’) cigenstates in terms of harmonic oscillators. We want the string variable 
X to act on the string eigenvector |X) such that we reproduce (2.2.9): 


Kin IX) = 5 (in ~ Ginn) IX). (6.3.5) 
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For the moment, let us make the assumption that the eigenvector |X) can be 
written as 


|X) = Q|0) 
=k] | exp(ax?, + bX,,a/,, + ca},a/,,)|0), (6.3.6) 


in *i,n 


where a, b, c, and k are arbitrary constants. 
Let us determine these constants a, b, and c by acting on this state vector 
by the operator X: 


Xin |X) = QQ iin — a} ,,)|0) 
= 2(11(bXin + 2cat, — at) 0). (6.3.7) 
Thus, we find 
b= —2i, 
C= (6.3.8) 


Second, we want the operator dy to have the correct commutation relations 
with X. If we operate on the state vector, we find 


|X) = (2aX;.n + baj,,) |X). (6.3.9) 
OMG n : 
Because we want 
rs 
= |X) = —i(ain + 4;,,) |X) (6.3.10) 
this fixes the coefficient to be a = —1. Our final result for the state vector is 
therefore 
IX) = kT exp (—X?, — 2iXinal, + 447,47) 10) (6.3.11) 


where k is a normalization constant. This expansion, in turn, allows us to 
calculate the field function as a power expansion in Hermite polynomials. For 
example, we can calculate 


(01X) =k] [e*, 


in 
2 


(0| din [X) = k(—2i Xin) | [e%. (6.3.12) 
jvm 
This, in turn, allows us to rewrite the original field functional (6.3.4) in terms 
of Hermite polynomials, using (6.3.11): 
(X|) = w(x) [| Ao(Xi.n) 
+ Ai(x)Hy(Xi1) | | Ho( Xin) +° (6.3.13) 


in 


n#l 
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(where we have taken a different normalization of the Hermite polynomials 
than usual). 

Let us now quantize this field functional, following closely the steps used 
in ordinary point particle field theory in (1.8.9) to (1.8.12). In general, we can 
power expand the field function in an arbitrary series of orthogonal polynomi- 
als. Let us choose an arbitrary element of the Fock space, which is a product 
of creation operators raised to some power. Let the set of integers nj denote 
the number of creation oscillators in a particular state with Lorentz index i and 
level number /. Thus, an arbitrary state of the Hilbert space can be written as 


n}) =e] [(@)" 10), (6.3.14) 
il 


where this state is a product of an arbitrary sequence of creation oscillators, 
and we choose the normalization constant to be 


6 =a) 7 (6.3.15) 
such that 
({n}|{m}) = din pny. (6.3.16) 


Thus, these states can be normalized so that they form an orthonormal basis. 
In fact, the basis is complete 


L= >)" |{n}) ({n}1. (6.3.17) 
{n) 


The matrix element of |{n}) in (6.3.14) with the string eigenstate |X) in 
(6.3.11) is just a product of Hermite polynomials: 


(X|{n}) = Hin(X)e7 Lew Xin, (6.3.18) 


Let us now explore some useful properties of |{n}). If we power expand our 
field functional in terms of this orthonormal basis, we can generalize (1.8.12): 


= 2 om I{n}) (6.3.19) 
{n} 
to the following 
(X|®) = 2 Pin) Ain (X)e~ Din Ni 
(n} 
The inner product between two such fields functionals is easily calculated 
(Pld) = Do vi din. (6.3.20) 
7} 


We can also show 


S(n},(m) = f Ps41m exp (- >i) D’¢, (6.3.21) 
{n} 
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where the measure of integration for this space is given by 
Dg = | [ dbin ddj,. (6.3.22) 
{n} 


Using these identities, we can now show that the number | can be written in a 
form analogous to (6.2.13): 


l= jh |b) D’® (®| exp(—(®|®)). (6.3.23) 


The above equation can be proved by power expanding each of the various 
field functionals via (6.3.19) and explicitly performing the integration over the 
coefficients. Thus, 


= [5 f em tomy I{n »| [[@ Pim} [> sin i} el?) 
{n} 


{m} 


D2 Fim Hrd) (LPH 6%, | A btmye Dia ar? 


{n}.{p}, {mm} 
= H{0}) ({P}] iny.¢p)- (6.3.24) 
{n},{p} 


Hence, we are justified in taking this expansion of the number | given by 
(G3e23), 

The matrix element between two string states |X) and | Y) can also be written 
in terms of |®) 


24 
=[[ [] sx%@-¥e) 


i=l 0<o<z 
= (xi) [ D'o(@|Y) exp (-\12) [ ozizie)) 


= / D*©0*(X)@(Y )e7 f PZ HZ), (6.3.25) 


Finally, we need to generalize (6.2.17) before deriving the Schrédinger 
equation for strings 


es il [Oy DD Oyen ts Ae |. (6.3.26) 


As before, this identity is most easily proved by first functionally integrating 
over P;. 

Now that we have all the identities in place, let us first consider the matrix 
element between two string states that are infinitesimally close 


(arem a 1X>). (6.3.27) 


.We can reexpress this matrix element either in terms of the first quantized pic- 
ture, where we insert acomplete set of momentum eigenstates, or in terms of the 
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second quantized picture, where we insert a complete set of field functionals. 
Thus, 


first quantization: [= |P) ‘| DEP |; 
. (6.3.28) 
second quantization: 1 = |®) | D? de~'*!®) (BI. 


Retracing our steps in Section 1.3, we inserted the momentum eigenstates 
and then derived the infinitesimal Green’s function for the first quantized 
formalism: 


(Xf eH" 1X5) = (Xi]P) far (P| ei##* |X.) 
-| —, 


Ore ee ee (6.3.29) 


where we used the fact that (P|X) ~ e'/¢7"*" Then we inserted an infinite 
number of these first quantized intermediate states between every infinitesimal 
interval between the initial and final points: 


xX 


(Xq| ee BX) = ( DX DPei Sa 4t40(PX-H) 
x, 
Xn ee 
< || DXDPeé i 4428 (6.3.30) 


X| 


where L = PX — H. 

Thus, we could go back and forth between Hamiltonian and Lagrangian 
formalisms in the first quantized string theory. 

Now let us repeat all our steps, inserting a complete set of second quantized 
field functionals into our action. Let us start with an infinitesimal transition 
amplitude 


Bin =X em aes) 
= (X,|X2) —i (X,| Hdr|X2)+--- 


= {| D?®12(X)|®1)(®2|X2) exp{—(®1|@ — ©) — (© |,)} 


=i f DO iaulXi1) (©y| H dr |) (®41X>) 


x exp} — D (D141 — O)) — (Ol) b+---. (63.31) 


f=Ik3 
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As before, we now take the limit as 


YGlei1- 4) + f axle). 


(3 


Next, we insert this complete set of intermediate field states between all in- 
finitesimal intervals between the initial string configuration and the final string 
configuration. Then the matrix element becomes 


Aw = / ®*(X})®(Xy)D°® exp ji f aco, Se on}. (6.3.32) 
This is our main result for this section. Therefore, our second quantized action 
is [1] 

L= *(id, — H)®. (6.3.33) 


In summary, we have now shown the equivalence of the first and second quan- 
tized formalisms for the free light cone string theory. We have shown that we 
can write, using (6.3.28), the Green’s function in either the first or second 
quantized language 


, Xj Tj 
ae =i DX exp if arL(o)| 
X; TF 


Xj 
= [ Pee xox ji far | L(@)Dx |. (6.3.34) 
Xe 
where 
Lay ae / do(X? — x7}, — (6.3.35) 
Dar 0 


and where 
cf" Pee. 
a= a do G + *) ; (6.3.36) 
2. 0 be 


We derived this second quantized field theory action from the requirement that 
it reproduces the Green’s function for the propagation of a string. This shows 
the strong parallel between first and second quantization for the case of free 
strings. (This parallel, however, will be drastically broken when we discuss the 


interactions.) 
Let us now quantize our action. The equation of motion derived from our 


action is 
(id, — H)®(X) = 0. (6:3°37) 
The energy that corresponds to a particular basis state is given by 


H |{n}) = Ejay |{n}), 
| i [Dap 
ca oe Bie + a'p?. (6.3.38) 
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We will find it convenient to take the Fourier transform of these coefficients 
with respect to x~: 


piny(x*, x, Xi) > bp+ tn (T, Xi). (6.3.39) 


Thus, the coefficients must satisfy the equations of motion 
Pp+,{ny(T, X;) = [aries te (6.3.40) 


Notice that we have introduced a new operator A; which is the creation or 
annihilation operator associated with the state {n}. It is crucial to notice that A 
is not the same thing as a! introduced in Chapter 1. a; creates a single vibratory 
mode of the string, while A creates or destroys an element of the Hilbert space 
spanned by all possible products of a’. A creates or destroys states defined on 
an infinite-component field theory. It satisfies the commutation relations: 


[Ap+. p,.tn}> Ara = 5(p* a q* )8(p; = Gi )5{n),{m)}- (6.3.41) 


Combining everything, we now have an expansion of the field functional in 
terms of plane waves: 


=> f Ten p*.pi.{n} gene nal) (6.3.42) 
{n} 


We can also express this power expansion in the X basis (see (1.8.21)): 


(X|®) = ®)+(X) 
=D f Teva poprtn Him (X)e7 De Me etPA- Eo dy | (6.3.43) 


We are now in a position to derive the canonical commutation relations in 
the second quantized string field theory. Because we have power expanded the 
field in terms of orthonormal polynomials, it is easy to show that [1] 


[p(X 1), ®).(¥. 7] = 6(p* — g*) | JP] 8X) — Yo). (6.3.44) 


Let us denote the vacuum of the A oscillators by |0)). Notice that this vacuum 
state is the product of the vacua of a// the higher spin fields contained within 
®. This state is the vacuum of the infinite-component field theory and hence 
has nothing to do with |0). Using the previous identities, find an expression 
for the Green’s function in terms of the field functionals: 


Ai2 = ((0| Pp+(X1, T1) PG, (X2, Tt) [0)) 
= apt — 4°) f Dxe'! 4" T] a(X(0, 1) - Xi(0)) 


x ] [sxe ™) — Kote); (6.3.45) 
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where the initial and final states are labeled by X(o, t;) and X (o’, T). It is 
important to notice that we have now expressed the Green’s function for the 
propagation of a free string in both the first and second quantized language. 
Thus, at least at the free level, we can go back and forth between these two for- 
malisms. We now have the tools with which we can write an explicit expression 
for the Green’s function for the free string, which generalizes the expression 
(6.2.4) for point particles: 


hoe 
ee a aes 
4 TL) Fy sinh(T/a) 


« (an) ee |-a - x9 


Pe KE kT 
x eXp {é sinh”! = cost res + X2) — 2X, |]. (6.3.46) 


where 7 is the time interval. Notice that now we have a derivation of the 
transition function A;; entirely in terms of second quantized field functionals. 
In summary, we have derived the free light cone string field action by di- 
rectly inserting a complete set of intermediate string states at every string 
configuration between the initial and final string configurations. 


6.4 Interactions 


Now we turn our attention to the question of interactions. Our derivation of the 
free action from the first quantized theory can now be generalized to calculate 
the interactions of the second quantized field theory. Again, we will insert the 
important identity 


1 =|) / D? de~'*'®) (|, (6.4.1) 


within the path integral in order to extract the vertex function. 

Historically, many of the early pioneers in quantum physics, such as Heisen- 
berg and Yukawa, looked into the question of nonlocal field theories and found 
that such theories violated causality, i.e., interactions could propagate faster 
than the speed of light. The nonlocal interactions, which can involve two dis- 
tant points x; and x, could transmit information faster than the speed of light, 
which is forbidden. 

Miraculously, string field theory solves this problem. The solution is simple 
but elegant: string field theory does not violate causality and the laws of quan- 
tum mechanics because it is not a nonlocal theory, it is actually a multilocal 
' theory. The interactions of the string, in which strings can break or reform, are 
such that strings break or reform instantaneously and then the vibrations travel 
down the string at or below light speed. Thus, we do not have a violation of 
causality. 
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C=C 


FIGURE 6.2. The five interactions of the light cone field theory. Open and closed 
strings can break, fission, and pinch. Notice that each interaction takes place locally 
along the string. This is the solution to the problem of causality, which is broken in 
all nonlocal point particle theories. Thus, string field theory is the only known field 
theory based on extended objects that preserves causality. 


Because all the symmetries have been extracted out in the light cone gauge. 
there is no overall guiding principle for the theory. We will, therefore, simply 
postulate the following principle: 


The only interacting string configurations that are allowed in the 
action are those that instantaneously change the local topology of 
strings. 


Although this principle is defined only in the light cone gauge. we will find 
that it is sufficient to determine all the possible interactions of the field theory. 
There are only five such local interactions (see Fig. 6.2) that are consistent with 
this new definition of locality and also the conservation of momentum. (We will 
choose the parametrization length of the light cone string to be proportional 
to p*, that is, ~ = 2p*, and hence momentum conservation implies that the 
sum over string lengths is conserved.) Thus, we postulate 


5 
cae ae ie (6.4.2) 
=) 


where each term corresponds to a specific interaction for an open string field 
® and a closed string ficld Y. These five interactions, symbolically speaking, 
can be represented as 


L,=P4+O4W4 OV + OV, (6.4.3) 


We will write explicitly what some of these interactions are, consistent with 
the conditions of locality. Once we have written specific representations for 
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these five interactions, we must check that they reproduce the known results 
from the first quantized theory. 

The simplest interaction involves the breaking of a string into two smaller 
pieces. To enforce the condition of locality, the string can break only at a single 
interior point along the string. The disturbances from this rupture should later 
spread along the string at velocities less than or equal to the speed of light. 
Thus, we are led to postulate that the points along the string are continuous 
across the boundary of the interactions. The unique form of the vertex function, 
consistent with momentum conservation and locality, is a series of Dirac delta 
functions that ensure continuity of the three strings. Our vertex is [1] 


3 
$3 = il dp'5 (» P| i DX 123 *(X3)1(X,)P(X7)5y23 + hic., (6.4.4) 
r=) 


where 
S103 = | | 81X3(03) — Ora — 0) X4(01) — O(0 — a) X2(%r)], 


DX 3 = DX, DX2DxXs3, (6.4.5) 
(see Fig. 6.3). We will use the notation 
0<o <2(a, +a), 
jo tor Oo = 710), 
0.=a0—7na, for ma; <o <(a,;+aQ)), (6.4.6) 
C—O) one for 0 =o ~ 7(e; +a), 


3 
da =0. 
i 


where the parametrization length of each string is given by 1a; and where the 
X variable represents only the transverse modes of the string. 

Remarkably, we can actually perform the DX integration in the above vertex 
function because it is a simple Gaussian. Let us define 


@,(X) = (X|®;). (6.4.7) 


o=n1(a, + a) 


FIGURE 6.3. Parametrization of the three-string vertex. The parametrization length 
of each string is given by za@,. The sum of all three 2a, is equal to zero. 
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Then we can rewrite (6.4.4) 


3 
3 = favs ( P| (D1 | (D2| (®3|Vi23), (6.4.8) 


i=) 


where we have introduced the vertex function 
|Vi23) = / DX \23 |X1) |X2) |X3) 8123. (6.4.9) 


Since the eigenvector |X) in (6.3.11) is a simple Gaussian in X, we can ex- 
plicitly calculate the integration over DX 23 and obtain an exact formula for 
the vertex function in terms of harmonic oscillators. 

The actual calculation will be more conveniently carried out in the P rep- 
resentation. By taking Fourier transforms, we can easily convert from X 
eigenstates in (6.3.11) to P eigenstates: 


|P) =k [exp (—1Pi + Pinal, — a},a),) (0). (6.4.10) 
We easily check that this expression reproduces the correct eigenvector 
equation (2.2.9): 
Pin |P) = Gin +4} ,)|P)- (6.4.11) 
Let us write explicit Fourier decompositions for all three string states in the 
vertex, in both the x and P representations: 


| (6.4.12) 
XO =| xis) Se cos — 
( i Bs n ie n c " 
where 
0, = O(a, — 0), 
6, = O(o — 14a@)), (6.4.13) 


65 =0,+6,=1. 


The integral that we would like to perform is given by 


3 
| Vi23) = f DPas [TIP bu 
i 
= / D P,236 oe 


ral 


3 
* [i] See, + PMA” — Lala) 10). (6.4.14) 


trae in Tin 
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The integral is simple and straightforward because it is a Gaussian. The only 
complication that we will face is the explicit form of the delta functional ;73 in 
(6.4.5) written in terms of harmonic oscillators. Let us now extract the Fourier 
components of the delta functional 5,3; in order to perform the functional 
integration. Let us take the cosine transform of the constraint equation among 
the various P’s. Ordinarily, cosine and sine transforms yield delta functions. 
Here, because the three strings all have different lengths, we find that the cosine 
and sine transforms yield matrix equations, not delta functions. In particular, 
the statement that 


3 
D5 9G, )P; (0) = POP(a3) + Pon) + P!(o1)0, =0 (6.4.15) 


E=l 


now becomes, when we take the cosine transform, 
co 
(3) (1) 2) 1 2 
p+ (AM pO + AD p+ BO p + Bp —0, (6.4.16) 
n=! 


where the A’s and B’s are the various overlap integrals between the cosine 
functions. Instead of enforcing this identity as an exact equation, this operator 
equation need only be satisfied when acting on the vertex function. We want 
the vertex function |V}23) to vanish when acted on by the 6)23. This means [1, 
6, 7] 


3 lee) 
(>> > ADA? +a) + pr) \Viza) = 0, 


r=! n=1 

(6.4.17) 

a co 

(1 tal, — — YC ACO) malay? + On |Vi2s) = 0, 
3 n=) 
where 
(C)inn = Momn, 

; P = 2pi Pp) — 277 P'. (6.4.18) 


Thus, the Fourier coefficients are given by taking all Fourier transforms of 
the various cosine modes. Because the three strings all have different lengths, 
we will in general have nontrivial Fourier coefficients. By explicit construction, 
we have the following Fourier coefficients: 


~ 1 Way 
A = 2(n/m)'/2(—1)"— | gee ie 
TA) Jo oa 03 
BaP) (6.4.19) 
n? — mp? 


ME (Go =o) Mo: 
cos —————— cos — 


= —(2/m)(mn)'?(-1)"" 


1 
A®) — 2in/my7(—1yn— | 
TO 
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ae) (6.4.20) 


= Ci Kenny Yr aeeainek 


Ao) = Ona 
il TEA] 
B= mye ey / cos “do 
TA) Jo 3 
= 2a3(1a)7'(—1)"(m)~*” sin(mxB), (6.4.21) 
1 7 {or +02) 
B®) = 2(m)'?(—1)" — i cos do 
IT 2 ray 3 
= —2a3(maz)'(—1)"(m) >” sin(mxB), (6.4.22) 


where 8 = a /03, BY = —o2B,, and B? = a By. 

Although the expressions may seem complicated, the actual calculation of 
the integral is simple because it is only a Gaussian integral. By performing 
the integral, we find an explicit form for the vertex function. We first take 
the integral over P), which is trivial because the delta function forces it to 
be a combination of the momenta for the other two strings. The functional 
integration over P‘!-*) is also easy, because it is also a Gaussian. When we 
combine terms, we find the compact result for the vertex function in terms of 
harmonic oscillators [1, 6]: 


] 3 le, @) ; 
|Vi23) = exp 5 » 3 a!) NTS a? 


ms—lintn—1 
3 oo 
+ So No) + KP? } 0), (6.4.23) 
eh | 
where 
Ninn = (Conn des — Amn) (ACT LE“! AM) (6.4.24) 
and 


Ni = —(m)'?(ACTD "Bn, 
K =—iBr'B, 
P = ap) — a7pi, (6.4.25) 


3 
C= pe A Ar 
|| 


Let us now summarize what we have done. We have postulated that the only 
interactions between strings are those that are local; i.c., only instantaneous 
local deformations of the topology of strings are allowed. In this way, we 
avoided problems with the violation of causality, which historically plagued 
attempts over the decades to create nonlocal field theories. What is surprising 
is that this principle alone is sufficient to determine all five string interactions 
in the light cone gauge. 
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Notice that the Neumann matrices N’*, are all determined uniquely from 
the overlap conditions (6.4.15). Locality and the conservation of momentum 
alone are sufficient to determine the precise oscillator representation of the 
vertex. 

Next, we wish to check that this reproduces the usual Veneziano formula. 
There are two things that have to be checked: 


(1) We must show that we reproduce the Neumann function for the Veneziano 
model. 

(2) We must show that the Jacobian for the transformation of coordinates from 
the light cone configuration t to the usual upper half z-plane is the correct 
one. 


We will now show that we reproduce the usual string amplitudes, demon- 
strating the equivalence of the first and second quantized interacting theories 
at the level of perturbation theory. 


6.5 Neumann Function Method 


In Chapter 2 we saw that we could perform the functional! integral over a disk 
with L handles: 


Ay = | duMy exp | hin ki] (6.5.1) 
L YT 


i>j 


where N(i, j) is the Neumann function between the points i and j on the rim 
of the disk or upper half-plane with handles, and My, includes all measure 
terms. 

The disadvantage of this approach, however, is that the functional integral 
is defined on the disk or the upper half-plane. The string interpretation of 
the conformal disk is obscure. To make the connection between the string 
approach and the conformal disk, let us first make the conformal transfor- 
mation » = Inz. This conformal map takes the upper half z-plane and 
maps it into a horizontal strip in the o-plane that has width z. This hori- 
zontal strip, in turn, can be interpreted as the surface swept out by a single 
free string. What we want is a generalization of this map to the N-point 
function. 

We follow Mandelstam [8] and make the following conformal transforma- 
tion on the upper half-plane: 


p= >a log(z — zi), 4 = 0, (6.5.2) 


where the parametrization length of each string is 7a. This conformal map 
stretches the upper half-plane into long horizontal strips, which correspond to 
the motion of splitting strings. 
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Analytically, we can see qualitatively that this transformation maps the upper 
half-plane into the proper light cone diagram. Let us start our variable z at 
positive infinity on the real axis and then slowly move to the left. As we 
approach one of the singularities on the real axis, p moves rapidly to negative 
infinity on the real axis. But we hop over one of the singularities z;, and then 
the logarithm picks up an imaginary part: 


p ~ a; log(e'”z) = ima; + a; log z. (6.5.3) 


Thus, in o-plane, we jump aq; vertically upward. This corresponds to moving 
from the bottom of a string of length 2@; to the top of the string at infinity. As 
we keep on moving in the z-plane to the left, the point in the p-plane begins 
to move to the right (displaced a distance 7a; upward). 

As we continue to move to the next singularity at z;_; on the real axis, 
something strange happens. At a certain point, the p variable can slow down, 
stop, reverse direction, and begin to move left in the complex plane to negative 
infinity. This turning point in the p-plane can be calculated by taking the 
derivative 


d 
turning point: - = (6.5.4) 
"6 


and solving for z. As we hit z;_; in the z-plane, the point in the p-plane moves 
to negative infinity. As we hop over the z;_; point, we move another 7a, _ | 
vertically upward in the p-plane. In this fashion, by carefully moving on the 
real axis in the z-plane, we sweep out the light cone configuration shown in 
Fig. 6.4. 

Fortunately, we know from (2.5.7) that the Neumann function in the upper 
half z-plane is the sum of two logarithms. Next, we want to power expand this 
same Neumann function in terms of variables defined on the o-plane. For the 
free string, for example, we can take coordinates & for the t direction and 7 
for the o direction. Then 


FIGURE 6.4. Parametrization of the N-point function. The incoming (outgoing) 
legs have positive (negative) parametrization lengths. By changing the lengths of the 
interior horizontal lines, we create the various field theory graphs that sum up to a 
dual amplitude. 
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It is now a straightforward process to power expand the Neumann function in 
terms of & and n: 


N(z, 2) = Injeé — e' | + Injeo — e°'| 


> 2 —nlé—€' | / 
=— ) ae cosnncosnn’ + 2max(é,é&'). (6.5.5) 
n=] 


The proof that this expansion reproduces the Neumann function is rather sim- 
ple. First, we notice that the function cosnn has zero o derivative at the 
endpoint, which is consistent with the fact that X’ = 0 at the endpoint of 
a string. Second, by acting on the power expansion by the operator V7, we can 
show that it vanishes except at the point z = z’. Then we can show that the term 
2 max(é, €'), which selects the maximum of the two, is the correct expression 
when we take € —> +oo. By uniqueness, this expression must therefore be 
correct. 

Solving for the Neumann function for the interacting string is done in the 
same way. First, we make an educated guess that the Neumann function is 
given by [8] 

[o.e} 


2 
N(p, p') = —4ys (=) cosnn, cosnn, exp[—nlé, — &/1] 


n=l 


at ye 2N/*, cosmn, cosnn’, + 26,; max(&, &’) 


— 2735,3 — 273553 + bys, (6.5.6) 


where the prime means to delete the n = 0, m = 0 term, and where we can 
define local coordinates directly on the rth string: 


p=a,(é, + in,) + const. (65:7) 


for r = 1, 2,3, and where the constant can be chosen so that the coordinates 
start at the nearest turning point. The proof that this expansion is correct 1s 
also carried out in the same fashion. Notice that the first two terms on the 
right-hand side of the equation are simply the terms coming from the free 
string, which guarantees that the equation V7N = 275°(z — z’) is satisfied. 
Second, we note that the rest of the function is nothing but a power expansion 
in terms of cos nn, which solves Laplace’s equations. Thus, since the Neumann 
function is unique and the expression solves Poisson’s equations, it must be 
correct. 

It is important to notice that we have done nothing new at this point. We 
have simply written the Neumann function as a Fourier expansion in the various 
string coordinates. The crucial factor in the above expression is N’*,, which 
will appear directly in the three-string vertex. These coefficients, in turn, can 
be explicitly calculated because we know the form of the Neumann function 
in the upper half-plane, which is the sum of two logarithms. Thus, we can take 

’ the various Fourier transforms of this known Neumann function, defined over 
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the upper half-plane, and extract out the Fourier coefficients Nm. We reverse 
the previous equation and now solve for the Neumann coefficients in terms 
of the Neumann function. If we take the Fourier transform of the Neumann 
expansion (6.5.5), we arrive at [6] 


if dz 1 Be) 
(ae Meare ee NCENZ n > 0 : 
Oh ee ne ( ) 
emer(z r)—n¢;(zs) 
NES ee ade or ; (6.5% 
ee 5 p f —z;) 
(x; = 1,x. = 0,x3 = oc). If we insert the Neumann function defined 


over the upper half-plane into the previous equation and make the proper 
change of variables, then we have an explicit formula for the Neumann 
coefficients. 

Now that we have an explicit form of the Fourier moments of the Neu- 
mann function defined over the light cone diagram, let us consider the case 
of three strings. The transformation of the upper half-plane to the three-string 
configuration is simply 


ep = ay, In(z — 1) + a2 Inz. (6.5.9) 
The turning point for this mapping can be calculated by taking the derivative 

d 2 

ee ee (6.5.10) 

dz 3 


Thus, the time at which the string splits is equal to 


3 
t = Re p(z = %) = ) a, In|a,|. (6.5.11) 


rl 


What is remarkable is that the mapping (6.5.9) can actually be inverted, solving 
for z in terms of p. This means that we can explicitly calculate all Neumann 
functions, rather than appealing to abstract expressions such as (6.5.8). 

We begin our discussion of calculating the Neumann coefficients by care- 
fully placing coordinates on the various three strings. For example, let us define 
the coordinate ¢3 on the third string: 


=o +in =b&+in. (6.5.12) 


O13 


By dividing out by a3, we guarantee that the 7 parameter ranges only from 0 
to z. In terms of these new variables, the mapping (6.5.9) can be written as 


~ine— a bin 21 n (1 - 2) 


9) ind + eben estinainzy, (6.5.13) 
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If we introduce new variables: 


oy hg te ene, (6.5.14) 
x=e®, (6.5.15) 
= 6.5.16 
= oe (6.5.16) 


then the map (6.5.9) reduces to 
y=y In(1 + xe’). (6.5.17) 


Fortunately, this equation can be solved explicitly by a power expansion of the 
form ~ 


=) vag (6.5.18) 


n=1 


Inserting (6.5.18) into (6.5.17), we find an explicit expression for a,: 


1 
i aly —1)---(ny —n +1). (6.5.19) 
Inserting back our expressions for z and ¢3 into (6.5.13), we now find 
co 
Inz = —¢3 +im + > ll (-=) es}, (6.5.20) 
P= 3 


Now compare this equation to (6.5.6), letting the variable z’ go to zero. In that 
limit, we have 


In |z| = —& + > N25 cosnn;. (6.5.21) 


n= 


Comparing these last two expressions, we see that N;> is proportional to a,. 
The same analysis can be carried out to the other strings, and eventually we 
can calculate all the Neumann coefficients in terms of a,,. The final result is [8] 


~ 


Nis, = —————-m 0903", N*, 
, ma3 + na, 
Ni, =a," fin(—o,+41/a,) exp(mty/a,), (6.5.22) 
where y, = —a@,+,/a, and 


An 


fr(¥r) = (ny,)! a 


Ke — 7 / (2001203), 
3 

= yea: In |e, |, (6.5.23) 
f= 


P = ap} — ap}. 
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At first, we see a vast difference between the functions derived from the 
second quantized action in (6.4.24) and the Neumann functions in (6.5.22) 
derived from conformal maps. The first version of the NV matrix is defined in 
terms of products of inverses of overlap integrals. The second version of the 
N matrices is in terms of Neumann functions on a Riemann surface. 

Miraculously, however, we can show that they are the same [6, 9] 


NvS = NS 
Nee 


Thus, although the two derivations seemed at first totally dissimilar, we wind 
up with an exact equivalence. (The actual proof that the Neumann coefficients 
found in the field theory (6.4.24) and the coefficients found in the first quantized 
theory (6.5.22) are the same can proceed in two ways. First, we can use the well- 
known theorem that the solution to Poisson’s equation with known boundary 
conditions is unique. We can show that both Neumann functions are solutions to 
the Poisson’s equation and are continuous across the tT = 0 configuration when 
the strings split. Thus, although the two expressions appear vastly different, 
they are actually the same. There is a second proof, however, which relies on 
brute force to show the equivalence of these two expressions. The proofrequires 
that we manipulate complicated expressions through the identity (6.5.8) [6]. 
Details of this calculation, however, are very tedious and will not presented.) 

The next step is to prove that the Veneziano model is reproduced by the light 
cone theory. 


6.6 Equivalence of the Scattering Amplitudes 


Let us begin our discussion by writing the N-point amplitude in the light cone 
formalism. The amplitude is a straightforward generalization of the amplitude 
found in the light cone point particle field theory [8]: 


N-1 
where 
W= [det Ie | [@)'e DIMA oats 


l ? ; , , 7 " 
x exp (| yf ao'ao" Po No wee <ee' 6 } (6.6.2) 


where b'") represents an incoming or outgoing state vector of the rth external 
string, t; represents the interaction times, P~ represents the Fourier transform 
component necessary to convert the expression to the on-shell amplitude, and 
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P,(s) represents momentum distribution smeared out over the string (which 
represents an arbitrary collection of higher resonances). 
Notice that there are two immediate problems with this amplitude: 


(1) The Neumann function occurs only with the transverse momentum exci- 
tations. We must show that the complete expression is Lorentz invariant, 
including the longitudinal momentum factors. 

(2) The interaction time variables 1; found in the string field theory must 
be transformed into the usual Koba—Nielsen variables z;. This requires a 
Jacobian, which in general is quite complicated. 


Let us resolve the first problem, the seemingly nonrelativistic form of the am- 
plitude, with the transverse and longitudinal factors occurring in qualitatively 
different fashion. However, this is an illusion. There is a trick that converts 
this expression into a Lorentz-invariant integral. Notice that the t variable is 
a solution of Laplace’s equation. Thus, because it has the correct boundary 
conditions, we can write it as a line integral over each external leg at infinity 


[8]: 
1 
I= = Y | aa'nte E04): (6.6.3) 


(This is a rather strange identity, which expresses T in terms of expressions 
containing itself. To prove this equation, simply multiply both sides of (2.5.6) 
by t’ and perform integrals over the two-dimensional space. By carefully elim- 
inating terms by integration by parts, we find the above equation.) Thus, we 
can replace the t variable with its Neumann function and write 


1 
eee =— ee foo N(o, 7,30", 1) 


ee 
aa Ps. [de | donee, t30',%). (6.6.4) 
54 Ol, Ol 


By adding this longitudinal contribution to the Neumann function containing 
P~ p* with the transverse contribution containing p;p;, we wind up with a 
totally Lorentz-invariant Neumann function. Thus, the terms of the form P~T 
are just the missing pieces necessary to covariantize the terms p; pj. 

Thus, the first problem of establishing the Lorentz invariance of the integrand 
was a trivial consequence of the uniqueness theorem in electrostatics. The 
second problem, converting the interaction times t; into the Koba—Nielsen 
variables, is also a straightforward problem which can be solved using the 
uniqueness theorem. 

We wish to calculate the Jacobian 


Peder E (6.6.5) 


xj 


where x; are the usual Veneziano variables on the real axis. 
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Let us choose the frame x; = 0, xy_| = 1, and xy = oo. Now compare this 
Jacobian with the factor 
J eas (6.6.6) 
where 
a*p 
Ci = Boa lean 
p = >_ a; In(x; — 2). (6.6.7) 


Notice that the Jacobian and the factor involving the derivatives of the 
transformation have the same analytic structure. They both have the same 
singularities when the various x; touch and have the same boundary condi- 
tions. Thus, they must be the same, up to numerical factors! To calculate this 
overall constant, we are free to take the x; spaced widely apart, so that 


pia tnx; > on, (6.6.8) 


S<i 


so that 


(6.6.9) 


(In addition, there are also complicated terms which include the determinant 
of the Laplacian defined over the four-point configuration and the zero—zero 
components of the Neumann functions. These terms cancel among each other 
if we carefully look at their analytic structure and their singularities. The first 
demonstration of their explicit cancellation was performed in [6] for the four- 
point function. The cancellation for the arbitrary case, including loops, was 
performed in [12].) Thus, we have reduced the Jacobian to a trivial factor [10— 
12], the product of the various Koba—Nielsen variables. Putting everything 
together, we have 


Ay = fl [ [ve i du (6.6.10) 


] _ ! , " ” 
x | [exp (; Df dota" Pa ito .7e ae ke ) ‘ 
<y has 


If we take external tachyons instead of arbitrary external resonance states, we 
find 


Ay = ait =a (6.6.11) 


i<j 


as before. Thus, the N-point amplitude is recovered. 
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6.7 Four-String Interaction 


Our previous intuition led us to postulate the existence of a four-string interac- 
tion [1], where two strings can combine at their interiors and instantaneously 
change their local topology. At first, it doesn’t appear as if this diagram should 
exist in the Neumann function method, since the upper half-plane was al- 
ways mapped into a configuration that was planar. However, the fact that the 
four-string (and all the other) interaction terms are present in the Veneziano 
amplitude can be seen if we rigorously examine the region of integration of 
the Koba—Nielsen variables. 

We begin with a four-string mapping and set x; = 1, x. = 00, x; = 0, and 
X4 = x. Then the four-string mapping becomes 


p = @ In(z — 1) + @3 Inz + a In(z — x). (6.7.1) 


To find where the mapping is singular, let us set 


do 
—=0. ue 
= (6.7.2) 
Solving this equation yields the turning points of the transformation: 
1 
= ——{l — + Al}, i. 
Zt Te Ey! sey’) 30) } (6.7.3) 
where 
gs! 
——_ (6.7.4 
ae Ona Qt oe 


A =x (¥2—- Vi) + 2x2Qny2 -— Nn — 2) +1. 


Normally, the two solutions of the turning point equation, given by the + 
in the square root, show that there are two turning points on the Riemann 
sheet that have the freedom of moving past each other. In the s- and t-channel 
diagrams, the world sheets of the strings smoothly transform into each other. 
However, an interesting thing happens when the imaginary parts of the two 
interacting points in the p-plane coincide. 

For example, let us study the Feynman graphs for the t- and u-channel 
scattering of four particles (see Fig. 6.5). We sce that these two graphs cannot 
smoothly turn into each other. The strings meet either near the top or near the 
bottom of the diagram, so there is no way of continuously deforming these 
two diagrams into each other. But this is impossible. The conformal map, by 
definition, was a smooth one that allowed us to go continuously from the f to 
the u channel and vice versa. /n other words, a piece of the integration region 
is missing! To see this, let us calculate precisely when the t- and u-channel 
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FIGURE 6.5. The four-string interaction. It is impossible to continuously deform 
the t- and u-channel four-string scattering diagrams into each other if we use only 
three-string vertices. Since the conformal mapping is continuous, this means that 
there is a missing piece of the integration region, which can be supplied only by 
postulating a new four-string interaction. 


diagrams meet. We set A equal to zero and then solve for x: 


] » 
= ——_ (n+ — 2 +2 — Doe — 1 ea 
Vie 


Notice that there are two solutions that give us the r- and u-channel turning 
points. The region (x, < x < 00) gives us one diagram, and the region 
(x. > x > —oo) gives us the other diagram, but what about the region in 
between? 

This is the missing piece. It represents a continuous deformation from one 
channel graph into the other graph, such that the local topology of the two 
graphs is only instantaneously deformed into the other (see Fig. 6.6). 


FIGURE 6.6. Topology of string interactions. The equipotential lines we can draw 
on a disk with external charges are isomorphic to the interactions of the string. If 
we have like charges located on opposite sides of the disk, then the equipotential 
lines collide in the very center and rearrange their topology. This is the four-string 
interaction. Likewise, all five allowed interactions can easily be displayed in this 
fashion. 
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In this intermediate region, the local topology of the four strings changes, 
such that the strings reconnect in a different sequence. Thus, this is 
graphical proof that the four-string interaction that we postulated ear- 
lier is, indeed, part of the Veneziano formula (for the f- and u-channel 
graphs). Without this missing piece, the string field theory is actually in- 
complete and violates conformal invariance and other properties of the 
S-matrix. 

The four-string interaction, therefore, requires an extra integration over do. 
This is like a “zipper” that allows us to locally change the topology of the 
four strings. At first, we might suspect that this interaction takes place faster 
than the speed of light. After all, the four-string interaction term takes place 
instantaneously in time because the do integration happens instantly. We seem 
to be violating our postulate of locality, which we originally imposed to yield 
a causal theory. 

The resolution of this puzzle is that the four-string interaction is analogous 
to the Coulomb interaction term that arises in Yang—Mills theories when quan- 
tizing in the Coulomb or light cone gauge. In the Coulomb gauge, the field Ao 
appears quadratically in AgV* Ao (without any time derivatives) and linearly 
in the coupling to fermions Apwy°w. By functionally integrating out over Ay. 
we find 


wy wv wyew. 


which is the four-fermion Coulomb term. Notice that the operator V ~ is 
an instantaneous operator (1.e., has no time dependence). This term appar- 
ently violates relativity, but the S-matrix is actually strictly causal even 
in the presence of this term. It is an artifact of the gauge fixing, and 
any apparent violation of causality disappears when we calculate the full 
S-matrix. 

The apparent violation of special relativity is only an illusion. Thus. the 
integration do over the zipper in the four-string interaction is consistent with 
special relativity. 

Let us write down the four-string interaction term for the field theory ac- 
tion. We stress that the four-string interaction term can be guessed simply by 
postulating locality but is also consistent, as we have seen, with the Veneziano 
amplitude. The interaction term is [1] 


4 
See / DXivu f dps (>: 2) 85 08; NIE NI Ain 


red 
(6.7.6) 
where pz is a measure term and 


3)234 = T(x = > X (0% ] []3( xv oa Ss X(o.¥). 
j=l O; 
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where 
4 
7 = (0: 
i=l 
os 0. a4 > 0, 
Cra U0, a0: 
0; = Ma,— 04 if y < 04 < 104, (6.7.7) 
07 = 1\a| — 03 if0 < 03 < x, 
O72 = 1A; — 03 if x < 03 < 103, 
02 = H\a2| — 04 if0 < 04 < y, 
y=x+7(a% — |a)), 
and 
0; = O(o4 — y), 
62 = O(y — 04), 
6; = 6(03 — x), 
6, = 6(x — 03). 


Although the four-string interaction can be explicitly calculated, it appears 
hopeless to generalize this to the complete N-point amplitude. There appear 
to be hundreds of possible diagrams that must tediously be checked. However, 
there is a convenient trick that will reduce this problem enormously. 

First, we notice that if external electric charges q; are placed on a circular 
disk, then the equipotential lines in the disk are easily drawn. The key obser- 
vation, however, is that the conformal map takes these equipotential lines and 
maps them into vertical lines on the light cone diagram. Thus, the topology 
of equipotential lines must reproduce the precise topology of interacting open 
and closed strings. This remarkable observation reduces a seemingly hopeless 
task to a relatively simple problem, that of drawing equipotential lines for a 
disk with external charges. 

For example, we know that t = Re p(z) = >); a In|z — z;|. But we also 
know that the electrostatic potential in two dimensions at a point r produced 
by a series of point charges q; is given by 


potential = 2 q In|r —r;|. 


Notice that we can reinterpret t as the electrostatic potential created by point 
charges a;. Therefore, lines of equal potential are precisely lines of equal t. 
But string propagating along the Mandelstam strip are just vertical lines, which 
correspond to lines of equal t. Thus, the equipotential lines on any conformal 
surface generated by charges a; correspond to the physical evolution of an 
interacting string. 
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From Fig. 6.6 it is obvious that a four-string vertex must also exist in the 
light cone formalism. In fact, by analyzing equipotential diagrams, we can 
show that exactly five distinct interaction terms must be added into the path 
integral. Notice that closed string interactions emerge from the open string 
sector as “bound states.” We see that the interacting Lagrangian must be the 
sum of all these five distinct terms. 


6.8 Superstring Field Theory 


It is possible to express both the NS—R and the GS action as a light cone second 
quantized field theory [13]. New features arise when we do this: 


(1) The field functional ® is now a functional of the spinor fields as well, 
which can be represented as the 8 of SO(8) or the 4 + 4 of SO(4). 

(2) The theory possesses supersymmetric generators that map interacting 
bosonic terms of the action into interacting fermionic ones. This will place 
strong constraints on the possible interactions. 

(3) Unlike the bosonic theory, we have to add a specific insertion term at the 
point at which the strings break. Without this extra insertion term, the 
theory is neither supersymmetric nor Lorentz invariant. 


Let us discuss the GS action in light cone language, because the action is 
explicitly supersymmetric. Our first quantized light cone action is given in 
(3.8.3) 


—1 . ee 
Sic = Fay | Pe0.X'0,x" — (2ia')p*d*y~d,0°), (6.8.1) 
4a! 


where 6'? are spinors with eight components in SO(8) space and simulta- 
neously spinors with two components in two dimensions. Recall that in 10 
dimensions a Dirac spinor has 32 complex components, a Majorana spinor has 
32 real components, a Majorana—Wey]l spinor has 16 real components, and in 
the light cone gauge the spinor has eight real components transforming under 
SO(8). 

The problem with quantizing this action is that the fermion field is self- 
conjugate. We have the equation 


Thus: 
{9'“(c), 0'(o')} = {074*(c), 07°(a')} ~ 8% 8(o6 — 0), 
{O'" (ajree (e )) — 0: (6.8.3) 


Notice that this is nor in canonical form. These fields are self-conjugate. The 
fermionic fields form a Clifford algebra, while we would prefer to have Grass- 
mann states without the delta function on the right-hand side of (6.8.3). The 
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simplest way out of this problem is to divide up the eight states in the spinor 
into 4 + 4 states, with one set of four states being the conjugate of the other 
four states. One way to do this is to use the SU(4) subgroup of SO(8): 


SO(8) D SO(6) @ O(2) = SU(4) ® U(1). (6.8.4) 
Under this decomposition, the 8 of SO(8) decomposes into 
8=404. (6.8.5) 
Let us now decompose the eight components of the spinor into 
gia (a'4, 218), 
974 — (674, 928), (6.8.6) 
where A and B range from | to 4. We use the notation 
4: 94= Ba, 
4: JA =A,. (6.8.7) 


With these new definitions, we have the new independent variables 6A, which 
are all mutually anticommuting without any delta function as in (6.8.3). The 
desired anticommutation relations with the canonical conjugates are 

(0° 4(e), 14-8 (0')} = 8(0 — 084284, (6.8.8) 


Because there are so many stages in the reduction of a Dirac spinor with 32 
complex component, let us summarizes how we reached only four independent 
states: 
Dirac = 32 complex components, 
Majorana = 32 real components, 
Majorana—Weyl = 16 real components, 
light Cone = 8 real components, 
canonical = 4 real components. 

Now that we have decomposed the spinors according to the SU(4) subgroup 
of SO(8), we have to decompose the vectors under SU(4) as well. The vectors 
A’, 1 =1,2,..., 8, will be broken up as follows: 

A! B! = A‘B' + A’ BR + A*B*, (6.8.9) 
where 
AR = Dex af xs). 
Ab = 2-/(X7 — ix®), (6.8.10) 
Our final action for the free theory is given by 


S= / D'°z[a, Vd_W + Tr(a, b3_®)], (6.8.11) 


282 6. Light Cone Field Theory 


where the independent set of integration variables is given by 
p'tg= px! Digs (6.8.12) 
and where ® are open string fields and W are closed string fields. 


Our basic field functional is now given by (where we have placed isopin 
labels on the field): 


©” X (a), 6'(c), 02(o)] = —0”[X(ra — 0), 0°(1a — 0), O'(na — 0)], 
(6.8.13) 
where the o variable is purely symbolic but was added to show how the field 
transforms under a twist. 
Following (6.3.44) for the bosonic case, we can construct canonical 
quantization relations: 


[6(1), &4(2)) = sla + a){5%8"A'[z,(0) — 23(0)] 


— 8948" A!8[24(0) — z2( |a2| — 0) ]}, (6.8.14) 
where 
z = (X!,@!4, 924), (6.8.15) 


Now that we have established the free theory of superstrings, let us begin 
the difficult task of constructing the interacting vertices for superstrings. We 
will find several complications: 


(1) There will be two sets of oscillators, not one, and separate continuity 
conditions arising from the overlap delta function. Fortunately. the two 
sets of oscillators commute with each other and do not mix. 

(2) There will be extra terms defined at the joining point of the three strings. 
In general, extra fields cannot be introduced along the string because they 
will violate Lorentz invariance and conformal invariance. However. fields 
can be placed at the precise point where the string breaks. We must be 
careful, however, because of singularities that exist at that point. 

(3) The greatest restriction is supersymmetry, which will completely deter- 
mine the nature of these insertions at the breaking point. 


We begin our discussion of the vertex function by postulating the form that 
it will take. Based on an analogy with the bosonic case, we postulate that the 
superstring vertex must look like 


|V) = Z, exp[Ap + As] |0) a( Ya. al oe p! a( > a uy. (6.8.16) 


where the Z; fields are insertions at the breaking point and Ao is the usual 
bosonic term found in (6.4.23): 


3 ae 3 
Ay = ; SS Ra) + bok ae — (6.8.17) 
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where @ = @a@3. We guess, purely by analogy with the bosonic theory, that 
the fermionic part must be quadratic in the creation operators. Let us adopt the 
following decomposition: 


Fi 1 7 
gia = ys RA eine/ial 
/Jor = n ’ 
2A 1 A 5 
g24 — oe RA ew ina/la| 
eae n 2 
yiA = ye Rie ener 
a 27 |a| 
1 p 
Bi A ,—ina/|o| 
a ag ee 


where 


{Re R?} = Oman, 0a, 


As = 3 » ROA YTS ROA 4 ss y VEROA@A, (6.8.18) 


ia foil pe 


where the U and V matrices are totally unknown and 
2 1 = = 
@4 = —(6", —6%)). 
a3 


There is no justification for this form (6.8.18) other than that it satisfies 
the basic boundary conditions that we will now apply. We will enforce the 


conditions 
3 ©,04 2105 
wee (6.8.19) 


where ¢€, is +1(—1) if the string state is incoming (outgoing) and the tilde 
represents the second oscillator. These conditions, when written out in Fourier 
modes, resemble the conditions found for the conservation of momentum. 
Specifically, we generalize (6.4.17) 


SOO Z 
S75 Aton’ — ROM) IV) = 0, 


Ty ager ROA 4 ROA) _ SB, O44 |V) =0. (6.8.20) 
len 


We also enforce the continuity conditions 


> AA) = 0, 


>a rA(o) = 0. (6.8.21) 
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These continuity conditions, in turn, require the following conditions on the 
Fourier modes: 


3. © 
ya RO* ihe a = 0. 


a, 


es) 
1 | ) 

— AO (ROA RO + —_ 8, | IV 0. eee) 
LSE Jn mn\ a ) wD 9@A 

We now have enough conditions with which to solve for the U and V 
matrices. The calculation is arduous, but we finally find 


en 


Unn = = a 
m - 
Vi = —aa,03V2—N". (6.8.23) 
a, 


Next, we wish to construct the supersymmetry operators in the theory, build- 
ing on our experience with the generators of the free supersymmetric theory. 
Let the first quantized supersymmetric generators in (3.8.22) be denoted by q; 
then the second quantized supersymmetric generators Q are related to q at the 
free level by 


OQ, = | a da / D'°Z[Tr ®_.q®q + V_ag Vo]. (6.8.24) 
0 


Notice that the canonical quantization conditions (6.8.14) guarantee that if the 
q’s form a supersymmetric algebra, then the Q’s must also. In particular, the 
first quantized q’s obey 


qua j= .¢ V2.0. (6.8.25) 


Now we wish to construct the second quantized version of these relations. 
Specifically, the interacting part of the second quantized generator is given by 


3 3 
Or ta fT] da,D"2,8 (d>-1) 
r=! i=l 
wn (322) oes elo ‘). (6.8.26) 


—) 


If we now substitute the expression for the second quantized generators into 
the commutation relations, we have a series of terms that must sum to zero. 
To the zeroth order in the coupling constant, this is guaranteed because the q's 
satisfy the relations of supersymmetry. However, the terms that are linear in 
the coupling constant contain mixing between the free g's and the interacting 
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|Q°) terms. Specifically, we find 
3 


= = 3 - = 
Daa On Pe gat Omt 2 |H)34?, 


rll r=] 


3 
2a moors) )+(A ae B)= 


= 
ee ~A\Q-8) + (A B)=0. (6.8.27) 


To solve these equations, we postulate that the Q’s have the general form 
o~ ary), 
(Ora) Kees Py 2 yeyenvy ss (6.8.28) 
We have now assembled a formidable apparatus, mostly constructed by guess- 
work and analogies from the bosonic case. Next, we must make one more 
educated guess, and that is the structure of the functions Z and Y appear- 
ing in (6.8.16) and (6.8.28). These last two functions are stringenily restricted 
because they cannot destroy the Fourier matching conditions that we have con- 


structed. Thus, they must either commute or anticommute with the continuity 
conditions. It turns out that this uniquely specifies Z and Y to be 


= [20|-V? (a ea oft), 


rm a, 


Ke tol" (04+ J sone Des a a) (6.8.29) 


Finally, let us turn the crank. We must generate |H) by the commutation 
relations of supersymmetry. By brute force, we now find that 


V2 


i 


pay’ Ay By NA ZR ABCDyAyByCyD IV). 
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(6.8.30) 
Now that we have a complete expression for the vertex functions that satisfy 
supersymmetry, let us try to rewrite the Z and Y functions. We noted earlier 
that they were chosen such that they commuted or anticommuted with the 
continuity conditions. We suspect, therefore that they actually vanish except 
at the breaking point of three strings. Because functions can easily diverge at 
the breaking point, we must carefully take limits as we approach that point. 
By careful analysis, we can rewrite Z and Y as 


y4 = lim(5e)'7(64 (ra — 6) + 641 (ra1 — 6), 
Z= lim /2enpi (may — €). (6.8.31) 
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The crucial thing to note here is that the Y’s and Z’s are both local on the 
string; i.e., they occur only at the point at which the strings join. Thus, we are 
spared any nonlocalities that would spoil Lorentz invariance. 


6.9 Summary 


Historically, it was conjectured that a field theory of extended objects was not 
possible for several basic reasons: 


(1) Attempts by Yukawa, Heisenberg, and others found that whenever we 
deviated from locality in quantum mechanics, we violated causality, as 
vibrations at one point of a blob would propagate faster than the speed of 
light throughout the blob. 

(2) The theory would not reproduce the Veneziano model, because cyclic 
symmetry and many of the properties of the Beta function only hold on- 
shell, while an action by definition is off-shell. 

(3) The theory could not be both Lorentz invariant and unitary off-shell be- 
cause a quantization program for strings does not exist which is both 
Lorentz invariant and unitary off-shell. 

(4) Likewise, in string theory, a field theory was thought to be impossible 
because of overcounting introduced through duality. The dual diagrams 
already have sums over s- and f-channel poles, and hence adding dual 
diagrams with poles in each of the various channels would yield severe 
overcounting, especially at the higher loops. 


Fortunately, the field theory of strings evades all these problems. It solves the 
first problem of nonlocality by introducing a mu/tiloca/l theory. Thus, changes 
in the string topology only take place locally; i.e., strings can either break or 
reform only at a single point in the interior or at the endpoints. The vibrations 
from the break travel at velocities equal to or less than the speed of light 
(neglecting Coulomb effects in a light cone theory.) 

String field theory also solves the second problem because it actually breaks 
some of the properties of the Veneziano model. This is still acceptable, because 
the theory reproduces the on-shell Veneziano model. 

Third, the BRST quantization program is Lorentz invariant, but only at the 
price of adding ghosts. This breaks unitarity off-shell, but the final S-matrix is 
unitary, so no physical principles are violated. 

Lastly, string ficld theory solves the problem of duality by explicitly break- 
ing duality. Only the sum over all Feynman graphs yields the correct dual 
diagrams. Thus, at any intermediate stage of the calculation, duality is actually 
nonexistent. 

We have stressed throughout this book that the second quantized formalism 
allows us to derive the entire model from a single action. In particular, the 
light cone formalism gives us an interpretation of the Feynman series where 
the string picture is clear. In the light cone gauge, we use the simple restriction 
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that all interactions must be local, i.e., strings can either break or reform 
only at single points along the string or the endpoints. This uniquely fixes 
all interaction terms in the action. In particular, we get terms that involve 
four-string interactions. 

We begin by deriving the theory in the same way that Feynman derived 
the Schrédinger equation from classical mechanics. We use the fundamental 
relation 


Xi > — (Age 1 Ao, to) 
= hile | axe) 


X2 
= DXel fa Mode 
Xx, 
= | D?OO*(X)O(Xy)e! xr LAeL@)DX. (6.9.1) 
where 


Ho sat i doa(X? — X?), 
20 
L(®) = 0*(i9, — H)®, 


a => [ao wae 
bee? : m2} 


We can prove all these relations by simply inserting different sets of 
intermediate states between the initial and final states: 


first quantization: ix) | Dx (X|=1,. 


second quantization: |®) / D? de~*®) (@| = 1. (6.9.2) 

The field functional ® is not a function of o. It is a functional of a string 
variable X that is defined over a region of o: 

DM = (XO) = OX (0), X(e5), x (3), ..-., X(en)]. (6.93) 


The easiest way to decompose this function is in a basis state of all possible 
members of the Fock space: 


|) = >) dim I{n}). (6.9.4) 
{rn} 


Because ® satisfies the string Schrodinger equations, we can power expand 
the field functional in eigenfunctions of solutions to the string Schrodinger 
" equation: 


bp tnj(T, Xi) = / dpe Oe Ae 5:10}: (6.9.5) 
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Notice that A is the creation /annihilation operator for creating or destroying all 
possible excitations of a string. Thus, it corresponds to an infinite-component 
field theory. We impose the standard canonical commutation relations, which 
force us to choose 


[Ape pistnt> Ag+ gam) = OlP” = a) 0e(P: — geet pa- (6.9.6) 

Now we can power expand the field function ® in terms of these eigenfunctions. 
(X|) = ®,+(X) 

= fT 4er Apr min thn Xe = x; Glee), (6.0) 
(n)} i 


We can reproduce all the identities found in field theory. In particular, we can 
show that the Green’s function is the matrix element of two fields: 


Ain = ((0] ®F.(X1, 11) ®g+(X2, 72) |0)) 
= (pt — of Dxe'J 44 | 5(X(o, 11) — Xi()) 
x | [6(X(, 2) — X2(0)). (6.9.8) 


Interactions are uniquely determined by our rule that the local topology can 
only change locally. Thus, our three-string vertex function is given by a delta 
function: 


3 
53 = f aps (>> £°) { Dx m9" 90" OLN le: 
r= 


(6.9.9) 
Fortunately, it is possible to perform the integration over the strings, which are 
simple Gaussians. After doing the integration, we find 


Nese ee . 
aie E ye aN nee 
r.s=1] mn=l 
3 fore) 
+ eee Ke}, (6.9.10) 
P— len 
where 
Nin = (Co 'Ymndrs — 2mny (ATLA) (6.9.11) 
and 


Nv S(T PB). 
K =—!Br'B, (6.9.12) 


3 


ey a 


—| 


II 
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At this point, we must show that this vertex function can yield the usual 
Veneziano model. The simplest way to show this is to calculate the Neumann 
function for the three-string configuration in the first quantized formalism and 
then compare the results. Our starting point is Mandelstam’s mapping: 


p =a, In(z — 1)+ a2 Inz. 


Since we know the Green’s function in the upper half-plane, we just take 
Fourier components of the Neumann function. The Fourier coefficients look 
like 


1 2n 
Nin = Oo | an-an.e. = nc) — z(6;)) 


d er (Zr)— ng3(zs) 
= — as - z, [ ieee on (6.9.13) 


Now we simply insert the Neumann function over the upper half-plane into 
the above formula. The calculation is straightforward, and we get 


mn ee 
nS = ————___@ 003’ N*, 
nm ma + na, 1A2031V 40 7 
N’ =a! fin(—ot41/a,) Exp(mT9/a,), (6.9.14) 


where 


n 
K = —1/(2a)a2a3), (6.9.15) 


3 
a he, In |a,|, 
r—! 


P= ap) — 7). 


fale) = (ny) ear 


Now let us analyze the question of superstrings in the GS light cone formal- 
ism. We now have two sets of oscillators, fermionic and bosonic. Our basic 
strategy is to guess an ansatz for the vertex function and force it to obey the 
overlap continuity equations. Based on an analogy with the bosonic theory, we 
guess 


|V} = Z; exp] Ap + As] (0) s( De. af SS pt )a( yas), (6.9.16) 


r 


_ where 


ie?) 


és 5 ROG Re yy VE ROAGA, (6.9.17) 


rs= nile 


pal 
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In Chapter 3 we showed that the first quantized supersymmetric generators 
satisfied 


{q7-4,q78) = 2n8"8, 
lq qa ge (6.9.18) 


Now we must show that the second quantized versions of these supersymmetry 
generators also satisfy these relations. We take the ansatz 


ea 
[O> eet! °” yetiaieed ae (6.9.19) 


Remarkably, we can satisfy all conditions placed on the vertex function with 
the choice 


y7= lim(56)'°(44 (101 Saye AA(ray et 
zZi= lim J2exp! (ray —&). (6.9.20) 


It is important to notice about this vertex that the condition of locality is 
still enforced. The additional insertions with Y and Z take place only at the 
breaking point of the string, and hence locality is preserved. Thus, our entire 
treatment of the light cone theory is consistent with our original assumption 
of locality. 


References 


[1] M. Kaku and K. Kikkawa, Phys. Rev. D10, 1110, 1823 (1974). 

[2] M. Kaku, /ntroduction to the Field Theory of Strings, Lewes Superstring 
Workshop, World Scientific, Singapore, 1985. 

[3] M. Kaku, String field theory, /nternat. J. Mod. Phys. A2, | (1987). 

[4] P. West, Gauge-Covariant String Field Theory. CERN-TH-4660 86. June 1986. 

[5] T. Banks, Gauge Invariant Actions for String Models. SLAC-PUB-3996, 

[6] E. Cremmer and J. L. Gervais, Nucl. Phys. B76, 209 (1974): Nucl. Phys. B90. 
410(1975): see also M. Ademollo. E. Del Guidice, P. Di Veechia, and S. Fubini, 
Nuovo Cimento 19A, 181 (1974). 

[7] J. F. L. Hopkinson, R. W. Tucker, and P. A. Collins. Phys. Rev. D12, 1653 
(1975); 

[8] S. Mandelstam, Nucl. Phys. B64, 205 (1973); B69, 77 (1974). 

[9] M. Green, J. H. Schwarz, and L. Brink, Nuc/. Phys. B219, 437 (1983): B243. 
475 (1984). 

[10] O. Alvarez, Nucl. Phys. B216, 125 (1983). 

[11] H. P. McKean, Jr. and I. M. Singer, J. Differential Geometry 1,43 (1967). 

[12] S. Mandelstam, in Unified String Theories (edited by M. B. Green and D. 
Gross), World Scientific, Singapore, 1986. 

[13] M.B. Green and J. H. Schwarz Phys. Lert. 149B. 117 (1984): Nucl. Phys. B128, 
43 (1983); Nucl. Phys. B243, 475 (1984). 


CHAPTER 7 


BRST Field Theory 


7.1 Covariant String Field Theory 


The great advantage of the light cone field theory, as we saw, was that it was 
unitary, manifestly ghost-free, and could reproduce the string amplitudes from 
a single action. There was no need to appeal to intuition in order to construct 
the unitary S-matrix. 

The light cone theory, however, was still a broken theory. We would like a 
covariant description in which all the gauges of the string are in operation. The 
next step in the development of string field theory is the use of BRST tech- 
niques to write a covariant description of string fields. The power of the BRST 
formalism is that we can reformulate string field theory in a fully covariant 
way with the introduction of Faddeev—Popov ghosts. 

We will extract the BRST field theory in the same way that Feynman ex- 
tracted the Schrodinger equation from the classical first quantized theory. We 
will start with the first quantized theory quantized in the BRST formalism 
and then extract the field functionals. It must be stressed that the BRST field 
theory, like the light cone formalism, is still a gauge-fixed field theory. Be- 
cause we will extract the BRST theory from the gauge-fixed first quantized 
theory, we will find rather bizarre remnants of the first quantized theory in- 
truding into the second quantized field theory, such as Faddeev—Popov ghosts, 
ghost counting numbers, parametrization midpoints, and parametrization 
lengths. 

There is also some irony here. Originally, the light cone field theory was 
introduced to provide a coherent, comprehensive formalism in which to express 
the entire theory. Unfortunately, the attempts to covariantize the model have 
produced not one but two competing covariant BRST string field theories! 
These two BRST theories are based on entirely different string topologies, and 


222 7. BRST Field Theory 


there does not appear to be any link between them other than the fact that they 
both can successfully reproduce the Veneziano model. 

Previously, we saw that the Gupta—Bleuler formalism, instead of solving 
the constraints (as in the light cone formalism), applies the constraints directly 
onto the states 


10) — 0: nee 0: (ie 


Normally, we would expect that all 26 components of the theory propagate in 
the action. However, this is not true if we are careful to include the effect of 
the constraints being applied to the state vectors. The effect of the previous 
equation is to kill off the ghost states from the spectrum. 

There is yet another way to eliminate the ghost states of the theory, and that 
is to follow the analogy of gauge theory. Instead of applying these constraints 
on the Hilbert space, we will now demand that the action be invariant under a 
transformation of the field ® when it rotates into a ghost 


5|Dj— LE _, |An)- G1 


Notice that the state L_, | A) is a ghost state, as we saw in (2.9.3). We motivate 
this choice because of an analogy with electromagnetism, where we have the 
gauge symmetry 


SA, = OA. (7.1.3) 


The action for the Maxwell field is invariant when we rotate the field A,, by a 
ghost field 0,,A. Let us now prove that the string variation actually contains the 
gauge variation of both the Maxwell field and the linearized gravitational field. 
As we saw from (6.3.19), the field |®) is the sum over all possible excitations 
of the string. We power expand the fields into their components 


|b) = P(x) |0) + Aza", |0)+---, 
[A) =A) |0) +---, (7.1.4) 
Eo hy ay ee 


Inserting this power expansion into our gauge transformation of the string field 
(7.1.2), we find 


5A,a", |0) +--+» =9,Aa", |0) +---. 


Equating terms in the power expansion, we obtain (7.1.3). Thus, we have 
derived the gauge transformation of the Maxwell field by power expanding the 
field function [1-3]. 

Similarly, we can also prove that the closed string sector contains the 
linearized gravitational field. We now demand 


5|W) = So Lan |An) + Lon |An), (7.15) 


7.1 Covariant String Field Theory 293 
where the bar designates the second, independent Hilbert space. Power 
expanding this expression, we arrive at 


IY) = (2) 10) + yy”, 10) +o, 
|A) =A,a",|0) +--:, 
] 


JA) = Ayat, |0) +--+, | (7.1.6) 
(oe = Kae, AF 3 
beak 


Putting everything together, we obtain 
Ota. ja 0) +++: = 4 A,a_,4a") (0) + (2 @ v) +-+-. GAs 
Equating coefficients, we find 
OM Oy dpi: Op Ay. (7.1.8) 


Thus, we recover the original variation of the linearized graviton field. 

Next, we wish to find an action that is invariant under this gauge 
transformation (7.1.2). 

Let us now repeat the arguments made in the previous chapter concerning 
how to derive the string field theory from the first quantized formalism. The key 
step was inserting a complete set of intermediate states at every intermediate 
point between the two string states. The insertion of the intermediate states is 
now complicated by the fact that we must preserve the gauge constraints at 
each step of the calculation. Generalizing (6.3.29), we find that the new set of 
intermediate states is therefore 


= PIX) | DX (x/P, (7.1.9) 


where P is a projection operator which guarantees that ghost states are 
eliminated at each intermediate step of the calculation. It satisfies [4] 


L,P = PL_, =0, P? =P. 


Repeating the same steps used in the previous chapter to extract the Lagrangian 
from the path integral, we now find that the action must be 


L=@®P[Lo — 1]P®. (7.1.10) 


The crucial fact is that this action possesses the local gauge symmetry (7.1.2). 
This gauge symmetry is responsible for eliminating the ghosts that appear in 
the propagator, which in general would propagate all 26 modes if the projection 
operator did not exist. 

It will prove useful to power expand this action and then compare the results 
with the Maxwell and gravitational actions. To lowest order, we can write the 
projection operation in the actions as 


(| (Lo — $L,L_) +--+) |®). (7.1211) 
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Let us now power expand this in terms of the lowest-lying states. The expression 
now becomes 


(0| A,a} (Ol — kyayk,a®,)Aoa?, |0) , G12 
because L_| ~ k-a),|®) ~ A,a“, |0), and Lo ~ UJ. When we reduce this 
expression, it becomes 

A, (n’'O — 848")Ay (7.1.13) 


which, of course, is proportional to the Maxwell action. Thus, to lowest order, 
the action (7.1.10) contains Maxwell’s theory. Similarly, we can repeat all 
of our steps and show that the closed string action reproduces the linearized 
graviton action. At the next level, however, there are complications. 

By brute force, we can power expand our projection operator to the next 
levels and we find 


P(Lo — 1)P = Lo — 1 — $L-1L; 
+ L*,(4L9 + 4D —9)ALi 
+ L? ,(6L) + 6)AL, + h.c. 
~ L_3(4L9 + 2)(2L9 +2)AL2 + +++. (7.1.14) 


where 
A = 2(16L3 + (2D — 10)L) + D)"!. (FAAS) 


The operator A is nonlocal because it contains the inverse of a polynomial in 
p’. For example, we know that p~ is nonlocal because it equals the Feynman 
propagator in x-space: 


l 
A(x — y) = (xJ| = ly). 
( - C] . 
Since x and y are two distinct points, the expression p~* in x-space is nonlocal. 
Fortunately, there is a way to remove any nonlocality introduced by poly- 
nomials in p~? and that is to introduce more auxiliary fields. For example. 
consider the Lagrangian 


pAy + Bd, (7.1.16) 


where A and B are operators. By eliminating the y field (either by calcu- 
lating its equation of motion and reinserting it back into the Lagrangian, or 
by functionally integrating over y in the path integral), we find the reduced 
Lagrangian 


— i@BA™' Bo. GANT) 


Notice that if A is a polynomial in p~, then A~' is a nonlocal operator. The 
point here is that we can trade off nonlocal actions for auxiliary fields. Since 
these auxiliary fields decouple from the final action (because they are ghosts) 
they do not affect any of the physics. 
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Now we wish to construct an explicit form for the full projection operator P 
for all levels. We can construct P in several ways, such as a power expansion 
in the L, operators, or a projector onto each state of level N. We will explore 
the latter possibility. 

First, let us define the Verma module (see (4.1.42)) as the set of all possible 
raising operators L_, acting on the vacuum |0) (a; < a+): 


L_jq)|0) = 1%, L%,---L |0), (7.1.18) 


where {a} symbolically represents a vast collection of indices. Then let us 
power expand the projection operator in terms of L,,;. We assume that the 
projection operator P has the following form: 


P=1+) Lb _.Fap(Lo)Lp, (7.1.19) 
a.B 


where F,, are arbitrary functions of Lo, and we must remember that we use 
the symbols w and 6 to represent large collections of indices. If we demand 
that the operator P vanish when multiplied by the Virasoro generators, then 
this fixes the F’’s exactly. Let P* represent the ghost projection operator. Then 


(es ey oe 
co = P(N) 
Eee LL FLL, (7.1.20) 
ihe lela iai=N 


where we have explicitly written in the sum over different levels VN. By demand- 
ing that the operator vanish when multiplied by L,,, this gives us a recursion 
relation satisfied by the polynomial F. Let us define the F matrix as 


FN (Lo) = > [ap — Ads (Lo) (S~')p, (Lo), (7.1.21) 
B 


where the matrices § and A have yet to be determined. By forcing the operator 
to have the correct properties, this gives us a recursion relation where the A 
matrix can be determined iteratively: 


N-1_ p(M) 


Ua yE gh RPS E-3AN(Lo)IR). (7.1.22) 
M=1 a,p=!) 
la|=|BI=M 


Lastly, the matrix S is defined as 
Sug |R) = LaLb_g\|R), (7,123) 


where |R) is a real state and |a| is equal to }7"_, iA;. 

Notice that this expansion is iterative. The Nth level is determined in terms 
of all N and lower states. Thus, knowing the projection operator at the first few 
levels, which we have explicitly written out in (7.1.14), gives us the projection 
operator at all levels N. 
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This projection operator is nonlocal in p”. This is easily remedied as we can 
calculate the F’s exactly because we know the precise location of all zeros of 
the determinant. 

Fortunately, the determinant of the matrix elements of Verma modules is 
known exactly. We know that, at the Nth level, the determinant of the matrix 
element between (0| L, and L_, |0) is given by 


det | (0 | Oy Bes: 10) H = | [Zo = Pe) len ole 


P.q 
hy. = [(mq —(m + 1)py? — 1]/4m(m — 1),(7.1.24) 
6 
C= Rares) 


where p, g are positive integers and their product is less than or equal to N. 
This is a remarkable formula, first written down by Kac [5], which has many 
ramifications. For example, if the Kac determinant is nonzero, this means that 
the matrix S introduced in (7.1.23) is invertible. This means that the Verma 
module forms an irreducible representation of the conformal group. Thus, the 
Kac determinant is essential in determining the irreducible representations for 
the string model. 

The principal use of this result is to locate all zeros of the determinant and 
hence the location of all the poles of the projection operator. Inserting (7.1.19) 
into our action, we find 


oO p(M) 
[by 1-5 Ss Lagi Ls] (7.1.25) 


Meal “are A 
la|=|B|= 
where F is related to F by a multiplication by Lo. The advantage of this 
expression is that we know the location of all the singularities of the F matrix. 
Now let us repeat the analysis of (7.1.16) and (7.1.17), where we introduced 
auxiliary fields in order to soak up the nonlocal terms in our action. We first 
decompose the poles within F: 


Fxg(Lo) = )_ AgsIL —0—ati] " + BY, (7.1.26) 
i,m 


We can now introduce auxiliary fields p to soak up the extra nonlocal pieces 
[6] contained within F: 


N=leear6=! 
lal=|BI=N 
co p(N) 
ae > pile = aN! HT 
N= pea= | 
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as AL Lat 
+ Clee, ) i, (F027) 


The point of this was to show that the action can be written in a form that is 
explicitly local in p?. The price we paid for locality, however, was introduction 
of an infinite set of auxiliary fields p%’". Our next task is to find a simple way 
in which to compactly represent this formula. 

In order to rearrange the terms in the above action in an elegant fashion, 
we will need to use the power of symmetry. First, we will introduce the 
BRST formalism, in which these auxiliary fields can be reinterpreted as being 
“Faddeev—Popov ghost degrees of freedom.” We will see that the auxiliary 
fields p’” can be represented in terms of states defined on anticommuting 
ghost fields. 

Second, the fact that these auxiliary fields form a Verma module is the first 
indication that there is an underlying group-theoretic origin to the ghost fields. 
BRST ghost fields, by themselves, are strange objects in a second quantized 
field theory, but the presence of Verma modules shows that these ghost fields 
are really representations of a deeper symmetry. 
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The BRST formalism uses the Faddeev—Popov ghosts to express these auxiliary 
fields p’" introduced in (7.1.27). These Faddeev—Popov ghosts will kill 2 of 
the 26 modes, giving us the required 24 physical modes. 

Let us review the first quantized BRST string theory. In the Lagrangian 
(2.1.41), we choose the conformal gauge rewritten in the form A = 1; = 0. 
We can write the Faddeev—Popov determinant for the conformal gauge as 


he / de dn8[Ae, n) — 1]8[o(e, n)] = 1. 


We can in turn, explicitly calculate this determinant by introducing the 
Faddeev—Popov ghost field in 6 as in (1.6.22) 


Ape [ee dpo exp | —i | do dtL(@, P|. (72.1) 
where 


L(O; po) = Pa(8x + 97)0' + pe (r — 9,07. (7.2.2) 


Notice that this expression allows us to rewrite the original action of our 
theory to include the Faddeev—Popov ghosts: 


L(o, t) = PuX, — $[Pi + X72 + pod), (7.23) 


where p is the Pauli spin matrix 09. 
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As before, this Lagrangian has a local gauge invariance that can be generated 
on the fields by the application of a nilpotent operator Q [7]. The simplest way 
to see this is to make a two-dimensional Wick rotation and rewrite the action 
in a manifestly conformally invariant fashion in terms of the complex variable 
z and anticommuting fields b and c. Rewriting (7.2.2), we find, as in (2.4.4), 


=a = 
L = — (38,X,8:X, + bdzc + bd,c} . (7.2.4) 
1 


As before, we see that this is invariant under 
Ox —e1COLN COs 
oc =| co-cl; 
6c = ©[c0:c], Cay 
6b = e[cd-b + 20.cb — +0,X,0.X*], 
5b = e[€0:b + 29:cb — 40:X,,0:X"]. 
From this variation, we can extract out the nilpotent BRST operator Q. 
However, it is also important to note that, in general, given anv Lie algebra 


with commutation relations [Am.An] = f,?,Ap, it is possible to construct a 
nilpotent operator Q out of anticommuting operators c,, and b,, [8]: 


O=) lee eongel (7.2.6) 


For our case, we have two sets of Faddeev-Popov ghosts with 
{Cin> Om} = Sn.—m- (7.2.7) 
Thus, our nilpotent BRST operator can be written as 


CO 
O= > ae, a7) 


n=—Oo 


| fo 2) 
25 ys, C_mC2n0pen i oe) 


n,m=CoO 
where 


eee == ee (m + n):Dy—mCmis 


ANESS 


Q* =0, (7.2.8) 


where the last identity fixes the dimension of space-time to be 26 and the 
intercept to be equal to 1. 
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Following the analogy with the point particle case, we can make the tran- 
sition to the second quantized formalism by taking time slices. This time, 
however, the basis states |X) are replaced by the complete set of intermediate 
states | X, 6) because of the presence of the Faddeev—Popov ghosts. This means 
that, when we insert an infinite number of intermediate states into our path in- 
tegral, we must also insert a complete set of intermediate states corresponding 
to the Faddeev—Popov ghost. Because the 6 variable occurs in the path integral 
with a tT derivative, we have no choice but to include it in the complete set of 
intermediate states when we factorize the functional integral. Thus, our field 
functional © becomes a function of these ghosts fields. Equations (6.3.29) and 
(7.1.9) are replaced by 


ye 1) x) | DxDe (| (XI, 
(6| (X|®) = ©(X, 6). (7.2.9) 


Our Green’s functions in (6.3.34) is now modified to read 
Ves 


B(A 3 6,; Dee) = | 
50) 


= | D’DO(X;, 6;)P*(X;, 8;) 


DX DP D@ DP, exp ji f 40 dtL(o, n| 


X,6; 
maexD [ / pxDo1()]. (7.2.10) 
Xi6; 
The basic field functional is now a function of the Faddeev—Popov ghost fields 
0. By making the transition from |X) |@), we find 


DSi leg = 1): . (7.2.11) 


where Lo is now a function of both the X and @ variables. This action was 
first written down by Siegel [9-14]. If we power expand the BRST field |®) 
in terms of its ghost modes, we find 


ID(X)) = D> bipg(Xb-pHe-g} I=). 
: {P}{q} 

This action explicitly propagates not only the 26 degrees of freedom of the 
string but also two ghost modes, yielding a theory with 24 physical modes. 
Notice that we are summing over all possible ghost numbers in this expression. 

Notice that the above action is totally gauge-fixed. It is also possible, how- 
ever, to write another action that contains an explicit gauge degree of freedom 
based on the nilpotency of the operator Q. Let us now replace (7.1.2) with 
a new gauge transformation. Before, we remarked that the origin of (7.1.2) 
comes from rotating a string field into a ghost field. In the BRST formalism, 
however, the physical conditions L, |phy) = 0 are replaced by Q |phy) = 0. 
Thus, we suspect that the new BRST gauge invariance is given by 


6|W) = Q|A). (72212) 
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The state (A| Q does not couple to physical states |phy), so this confirms our 
conjecture. Because Q is already nilpotent, our choice for the new action is 
[15-24] 


L=(b| OW), C213} 
where we take only the “ghost number —i truncation” of the original field: 
[Y) = Pip|Y), 


where P is the projection operator that extracts only ghost number — i Notice 
that the new field |W) is a subset or truncation of the original field |®). 

If we decompose this action, we find that the lowest excitation of |W) obeys 
precisely (7.1.10), so this is the desired generalization of our action, where the 
auxiliary fields that we found there are now regrouped according to the mode 
expansion found in |W). In summary, we have now compactly reformulated 
(EZ as ie2 1a) 

The equation of motion corresponding to this action is 


Q|v) =0, (7.2.14) 


which reduces to the usual constraints L,, |¢) = 0 for the ground state. 


7.3 Gauge Fixing 


It can also be shown that the previous action (7.2.13) can be gauge-fixed to 
obtain either the action (7.2.11) or the light cone action (6.3.33). To fix the 
gauge, it is sometimes helpful to decompose the operator Q according to its 
zero modes 


Q=cK —2bR+d+6, 73) 
where 


K= Lo —1 + So (near =e nb_,b,), 
n=} 


ree el: G22 
f=) 


d = Con(Ln + flmnd-pOm + 3C-mSpnd—p)s 
6 8G) (Ly PD 5 {be p. coatieetalt). 


where the f’s are the structure constants of the Virasoro algebra. Be- 
cause Q is nilpotent, these operators must satisfy a large number of simple 
identities: 
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[R,d] =[R, 8] =0, (7183) 
l= ORK 


The process of reducing this action down to the usual one begins by writing 
down the ghost vacuum of the theory. 
In Chapter 4 we introduced the “ghost counting number” operator (4.4.20): 


lo, @) 

> (C-nbn + b-nen) + $1¢0, bo]. (7.3.4) 

n=1 
This operator counts the number of c modes minus the number of b modes. 
We can use it to label eigenstates. Because both of the b, c ghosts have zero 
modes, the vacuum of the Faddeev—Popov ghost fields is quite different from 
the usual unique vacuum of a’. The ghost vacuum (as we saw in Chapter 4) is 
actually degenerate and can have ghost number either + or —!. We can define 
two vacua: 


co |+) = 0, By |—) = 0, 
[-) = Do l=). (785) 


There are thus two ghost vacua. Let us fix the ghost number of WV to be —5 
and decompose it in terms of the two vacua: 


Iv) = wl-)+¢l+). (7.3.6) 
Then the equation for action (7.2.13) becomes (using (7.3.1)): 
(Wl Kw) + (Wide) + (8blv) + (1 RO). (7.3.7) 
If we write 
|A) =A|-) + @|+), (7.3.8) 
then the variation of the field becomes 
6v =(d +6)A — 2K, G39) 
6@ = —KA+(d+ 38)w. (7.3.10) 


In general, the relations that we have written above can be reexpressed in 
component language. We can always decompose a field functional into its 
ghost modes: 

[o, ¢) 
|W) = Sy C—n,C—ny ** SCL On Der ta Donn oe nwa ear mau |0) . 


{n},{m} 
Geri) 


Notice that the indices are antisymmetric with each other because of the 
anticommutation relations between the ghosts. This allows us to write 


N 
wea ( yy) t (7.3.12) 
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where Cy) represents the product of N antisymmetrized and M antisym- 
metrized c and b fields. Notice that this allows us to introduce “forms” 
defined on anticommuting variables in the same way that differential forms 
are introduced via anticommuting dx” [10]. 

We now have the apparatus necessary to perform gauge fixing. Let us first 
choose the covariant gauge [9]: 


bo |W) = 0, (7:3. 135 


which eliminates half of the fields in |W), that is, 6 = O in (7.3.6). Now 
calculate the Faddeev—Popov determinant, which arises from the variation of 
boy |W) = 0. We find 


Lyp = (Al|boQ |A) 
= (A|K |A), (7.3.14) 


where (A| is a two-form. The strange feature of this ghost action is that it, in 
turn, possesses its own gauge invariance: 


O)A) = OVay)- (7.3513) 


Thus, the ghost action itself requires yet another Faddeev—Popov determinant 
(or else the functional integration over the ghost fields is infinite). The next 
ghost term coming from the Faddeev—Popov determinant of the ghost field is 


Lrp = (A2|K | Ai). (7.3.16) 


But this action, in turn, also contains is own gauge symmetry, which then 
requires its own Faddeev—Popov gauge term, and so on. It is clear that there is an 
infinite tower of “ghosts of ghosts” {10, 12]. This is unavoidable. because each 
Faddeev—Popov determinant is necessary to eliminate the infinities introduced 
by the previous set of ghost fields. 

Fortunately, it is possible to sum the series. If we start with our action 
(7.2.13), which has only ghost number —4, and then add to it this infinite 
tower of ghost actions, we simply retrieve the action (7.2.11), which is the sum 
over all possible ghost numbers. Thus, (7.2.11) is the gauge-fixed version of 
(7.2.13). In other words, a string functional with ghost number —+ has been 
transformed, with the addition of the ghosts of ghosts, into a string functional 
of arbitrary ghost number. 

Similarly, it is also possible to use the gauge degrees of freedom within 
(7.2.13) to reach the light cone gauge [25]. The problem, however, is that 
the equations of motion must be used explicitly for certain fields to eliminate 
others. Thus, only on-shell can the BRST formalism reach the light cone gauge. 
which is a limitation of the formalism. In the geometric formalism, however, we 
have enough gauge symmetry to eliminate the redundant longitudinal modes 
in the action in order to obtain the light cone field theory off-shell. 

The transition to the light cone gauge is made by observing that the unwanted 
states, which must be gauged away, can be arranged as the product of all states 
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of the form 
(aeae pence 0 (317) 


The level number of such a state is given by the sum of the level numbers of 
each collection of states: 


ng tn_tnptn.=QN. (7.3.18) 
It can be shown that these unwanted states can be eliminated through a 
variety of mechanisms. For example, for the lowest statesn, = n_ = 0,1 we 


find: 


(a) Most states withn. = n,;n_ > n, can be eliminated through 6W = QA. 

(b) States with n, = 0,n, = 1;n_ <n, are Lagrange multipliers. 

(c) States not included in (a) with n, = 1, n, = 1; n > nx are eliminated by 
these Lagrange multipliers. 


This generalizes to all unwanted states. In general, all these unwanted states 
can be eliminated by using the gauge invariance of the theory or by using 
Lagrange multipliers. Only the physical transverse DDF states are left after 
this elimination. 

In summary, we have shown that the action (7.2.13) can be gauge-fixed to 
yield either the original BRST action (7.2.11) or the light cone action of the 
previous chapter. We can also show (by eliminating out all higher excitations 
within |)) that the action reduces back to (7.1.10). 


7.4 Interactions 


In contrast to the light cone field theory, where strings simply split in their inte- 
riors, the interacting BRST formalism of Witten [16] relies on the configuration 
shown in Fig. 7.1. (At first, this configuration seems to violate momentum 
conservation. However, only in the light cone gauge is the momentum of a 
string linked to its length. In the covariant approach, the parametrization length 
is not related to the momentum, so this diagram is allowed.) The length of all 
strings is set equal. Once again, we can generalize the delta functions appearing 
in (6.4.4) to include this new configuration: 


se / DX 130(X1)®(X3)®(X3)5 123 


= i DX 03 (©1| (®p| (311) 1X2) 1X3) 82a, (7.4.1) 


where 


ie) 


3 
w= [] [[txX@- XAG -o)). (7.4.2) 


O0<o0 <n/2 r=l 
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3 2 


jon 


FIGURE 7.1. Symmetric interaction of BRST string field theory. The surface swept 
out using this vertex is not flat, as in light cone field theory. There is no need for a 
four-string interaction. 


As in (6.5.2), we must find the conformal map that takes us from the upper 
half-plane to the configuration being studied in order to construct the vertex in 
oscillator form [26-32]. Unfortunately, a conformal map of three charges into 
the upper half-plane (without cuts) does not exist for the BRST vertex because 
the sum of the changes is usually taken to be zero, while here the sum of the 
three charges for the symmetric configuration must add up to 3. The solution 
is to write electrostatics with six charges adding up to zero charge, and then 
identify boundaries to simulate the presence of three charges. This requires 
cutting and pasting together several regions of the complex plane. The map is 


FIGURE 7.2. Conformal surface of BRST string field theory. This conformal surface 
can be represented either by taking six charges and resewing the diagram to obtain 
a surface with three charges or by placing a Riemann cut on the surface. 


7.4 Interactions 305 


(see Fig. 7.2) 


+3 fous 
p= Ee in(z — z;) = In aa) Si. (7.4.3) 
where 
Ch SCh ]=0s = IL 

Qj =a. — a; = —1, 
zy = e'7/6 ee (7.4.4) 

Sane Z-1 = —Z}, 

(69) ar GF Lay — 23 


Fortunately, this map can be inverted, so we can solve for z in terms of ¢, which 
makes it possible to explicitly construct the Neumann functions. Inverting the 
map, we find 


ee 8 
2746) ms (7.4.5) 
1 — ies 
where 
P= ba = Ea +ina (7.4.6) 
fora = 1, 2,3 and 
p = —(q — im) vs) 
for a = —1, —2, —3. The Fourier coefficients will occur in the combina- 


tions 


fesie \ 
(ee) aS anti Ane 


ate. n=2k n=2k+1 
esa” 

(55) = eae Basie (7.4.8) 
Beene n=2k n=2k+1 


Putting everything together, we find that the Neumann function appearing in 
the vertex is 


Coed 
Nan =—3(C+U+U)1—Z(C—7U—zU)| 1 0 | 
eae 
0 1-1 
—tivaQU-U)} -1 0 1], (7.4.9) 


| 0 
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where 


Oe = 2(—1)"/ndnm: 


7 AnBm at Divas An BR = Bi Am 
9 (1) | ee ae 
(U + Dam = —2( Tal a Ce 
= An Bm — BrAm An Bn - BnAn 
(U — (am = —2i | 2 ——** +. = |. 
n+m =a) 


Written explicitly in terms of operators, this equals 


|Vi23)x =a oo Nee gat Py Nee —m a) dng rs | 10123) . 


r=1 
(7.4.11) 
We can, by using the continuity conditions for the ghosts, also calculate the 
ghost contribution to the vertex function. We simply replace X in (7.4.2) by 
the b and c ghosts. The ghost vertex is 


|Vi23)en = exp papa” °.] ae (7.4.12) 


Orit 


where the vacuum |+ + +) is the product of three vacua introduced in (7.3.5). 
For s =r ors =r+2, the X matrix is given by 


X's = —m(NiS — NTS*) (7.4.13) 


nim 


and fors =r+1 
XS =p NO = Ne), (7.4.14) 


nim 


The final vertex is the product of these two vertices, which are defined in two 
entirely different spaces 


|Vi23) = |Vi23) x |Vi23) oh - 


By a lengthy calculation, it is possible to show that this vertex satisfies the 
BRST invariance [29]: 


DPOte Os O07) | Visi 0: (7.4.15) 


There are several different ways in which to write this vertex. First, we know 
from conformal field theory that we can bosonize the anticommuting b. ¢ ghost 
system in terms of a scalar field, which we will call ¢. Then the new ghost 
vertex can be written as 


3 
= | | [Px D¢,enees 
rt 


x [] Xi@)— Xi" — 0))8(@-(0) - b-41(7 — 0). (7.4.16) 


O<a<(1/2)n 
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(What is somewhat surprising about this bosonized vertex is the presence of 
the midpoint peeesien, This term does not spoil locality in 0 because : ap- 
pears only at 4 57. Furthermore, it provides the correct ghost number 2 5 for 
the multiplication symbol *. Because the gauge Pehle! A has ghost num- 
ber —;, the symbol * must have ghost number 3 5 so that the product of two 
gauge Tananaien A,2 yields a third gauge = naee A; with the same ghost 
number.) 

There is yet another way in which to calculate the symmetric vertex, which 
depends on using conformal maps with Riemann cuts (rather than splicing 
different regions of the complex plane together). Consider the conformal map 


te al 2 3/2 
A ere a a) in (7.4.17) 
a1 = 2) 

This map has the usual singularities at z = 0, 1, and 00 (which correspond to 
the three outgoing and ingoing strings at +00). However, what is new is that 
this conformal map has an explicit Riemann cut that creates a multisheeted p 
plane. This cut is precisely what is necessary to create the surface in the p- 
plane that pies the symmetric collision of Cs ees The cut extends 
vertically from + — }./3i in the lower half-plane to 4 5 |_| 1./3i in the upper half- 
plane. We can tidersand the multisheeted p-plane a tracing our movement 
along the real z-axis in Fig. 7.3. 

If we begin at z = +00 on the positive real z-axis and move left, we can 
choose the constant k such that we trace out a similar line that moves along the 
positive real -axis to negative real infinity. When we reach the point z = | (at 
point B), we reach negative infinity on the real axis. By hopping over z = 1, 


U H|G F\D CB A 


FIGURE 7.3; 
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we suddenly oe zr units in the vertical direction in the p-plane. Moving from 
z= | tome of we move horizontally right in the p-plane from C to D until 
we hit the y-axis at p = iz. ut we now move vertically up the Riemann cut 
until we hit the pore EZ 5 + LiVv3 , then we move down the y-axis in 
the p-plane to p = 51. Going back down the cut to the li a = i means 
that we go from p = in to F, the origin. Moving from z = 5; og = it 
means moving from p = 0 to negative real infinity (on the ee iui 
sheet). Hopping over z = 0 means moving vertically up by iz at negative 
real infinity. Finally, moving from z = 0 to z = —oo means moving from 
p=in—otop =inz + oo (from H to /). 

It is also possible to extract the Neumann functions directly from the map 
(7.4.17). This alternative approach to the vertex function is presented in [30— 
eZ 


7.5 Witten’s String Field Theory 


If the parametrization midpoint of the string is singled out as a special point, 
then it becomes possible to define a closed algebra among string field func- 
tionals. A string represented by |A) can be joined with a string |B) such that 
their midpoints exactly coincide. By contracting over their oscillator states, we 
are left with yet another string of equal length, so we have defined a process 
similar to multiplication [16]. For example, the abstract notation 


means concretely 
(Ay| (Bo|Vi23) = ICs), (7352) 


where we have contracted over the first and second harmonic oscillators and are 
left with a string defined over the third harmenic oscillators. Because strings 
all have equal parametrization lengths, the product rule closes if the parametri- 
zation midpoint is singled out as a special point; i.e., the product of two strings 
of length one is equal to another string of length one. This, in turn, allows us 
to define a gauge transformation of the field functionals as 


bA=QA+A*x A—AXA. (7.3.3) 
If we define a “curvature” as 
F=QA+A#A, (7.5.4) 
then we find that 


SF =F*eA—AxF. (7.5.5) 
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Thus, if we can now define an operation called “integration” that preserves 


faxB=Cv" [axa (7.5.6) 


(where we use the anticommutation sign for Grassmann odd forms), then we 
find two invariants, a surface term, and the action itself, 


‘| F * F = surface term, 


L=AxQA+2AxAXA. (7.5.7) 


The remarkable nature of this approach is that we can reduce the essential 
features of the theory to five “axioms” as in the case of gauge theory: 


(1) Existence of a nilpotent derivation 


Q’ =0. (7.5.8) 
(2) Associativity of the « product 
[Ae Bx C= Ax B eC]: (7.5.9) 
(3) Leibnitz rule 
Q[A* B] = QA*B+(—1)4A x QB. (7.5.10) 
(4) The product rule 
fase=cay™ faxa, Ces i1) 
(5) The integration rule 
(h OA 0: (7.5.12) 


where (—1)* is —1 if A is Grassmann odd and +1 if it is Grassmann even. 


We define the integration operation as follows: 


fe = 1) DX [|] 8(X@)-X@-0))%(X) 
= (1| 


0<o <(1/2)z 
0), (7.5.13) 


This operation / is a rather strange object. It means that we take a functional 
of a string X, find its midpoint, and then integrate such that we fold the string 
back on itself around its midpoint. Thus, a functional of a string maps into a 
c-number under this identity operation. 

We can also give an explicit representation of this identity operator /: 


= |] 8G), 


Neuss 
TI) = p WALI ant |O). (7.5.14) 
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In addition, the vertex function is invariant under a subgroup of the conformal 
transformations, i.e., those that do not affect the location of the midpoint. Let 
us define 


Ko=iL, —(—1) Les. C3.15) 
This operator acts like a derivative on products of forms 
K,(A* B)=K,A*B+A*xK,B. (7.5.16) 


The integration satisfies the following relation: 
[ A= (AIK, |) =0 (5g) 


while the vertex function satisfies 
3 
SRV = 9. (7.5.18) 
il 
The advantage of this cohomological approach is that it adheres closely to the 
cohomological formulation of gauge theory in terms of forms. For Maxwell's 
equations, we have 


4 


d =diaa. a2: 
A= Gx At (PSelg) 
We can write this in compact form: 
fF =dA A a. 
d6bA=dA+AxA—A#A, (7:53:20) 


OF a FicwA — Neer. 


From these equations, we can write invariants 
[para f dtaxdatzawas a (525 


The strength of the BRST approach is that we can write the five axioms 
and the definitions of curvature forms such that the action is easily shown to 
be gauge invariant. These elegant axioms sum up a tremendous amount of 
information, capsuling the string model in five statements. 

The disadvantage of the BRST approach, however, is that the origin of these 
axioms is obscure. These five axioms can, in principle, apply equally well to 
any cohomological system. We have no understanding. therefore, of where 
these five axioms came from. For example, general relativity theory can be 
derived, as we have seen, from two principles. These principles, in turn, can be 
reformulated into the five axioms of cohomology. The five axioms, therefore, 
are not fundamental, but only a compact and convenient method of writing the 
tensor calculus. We seek, therefore, the underlying geometric principles that 
will allow us to derive these five axioms from first principles. The five axioms 
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(i.e., the tensor calculus) should arise naturally from a new infinite-dimensional 
group. 


7.6 Proof of Equivalence 


Although this is not obvious, this BRST vertex |Vj23), when the external par- 
ticles are placed on-shell, is equivalent to any other on-shell vertex function 
that can be reached by a conformal transformation generated by L,,. For ex- 
ample, let us define another vertex function | Vis) that can be expressed as a 
conformal transformation of the original vertex: 


|Vi23) = [Joo bs a |V123) . (7.6.1) 


n>0 


By acting on this equation with a physical state (phy| that satisfies (phy| L_, = 
0 we are left with 


(phy|Vi23) = (phy| V 123). (7.6.2) 


Thus, the two vertex functions have the same on-shell matrix elements. In 
this way, it has been shown that the vertex function shown here is equivalent 
on-shell to the covariant oscillator version of the Caneschi~Schwimmer— 
Veneziano [33, 34] vertex found in the early days of the dual model. (This 
vertex function was actually derived by taking N-point trees and then factoriz- 
ing the amplitude three times to extract out the three-Reggeon vertex function.) 
This shows that we must obtain the same S-matrix elements by using either 
vertex. This proves that the three-string vertex found in the BRST formalism 
yields precisely the same on-shell matrix elements as the old CSV vertex [29, 
30], even though the geometry of the BRST vertex is completely different. 
In fact, we can use this fact to show that all three vertices (the old light cone 
vertex, the old covariant vertex, and the new BRST vertex) all yield the same 
S-matrix elements. 

The proof that the vertex function gives rise to the correct on-shell vertex 
function of the Veneziano model is quite simple [30—32, 35, 36]. We will take 
matrix elements of both vertex functions with coherent states that are functions 
of a variable z and then show that the resulting expressions are related to 
each other by a conformal transformation. We then show that the conformal 
transformation can be expressed as (7.6.1). 

The old CSV vertex is 


ae 


[Vesv) = Yow (ss i allan ‘se ae» Co"). (7.6.3) 


n.m=| 
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where the C matrix satisfies the following property: 
co m ae 
pe Coo = Gd=zr-1 (7.6.4) 


(As a check, we can always hit one of the Reggeon legs with the vacuum, 
convert a three-oscillator Hilbert space into one oscillator, and reproduce the 
usual vertex function for the tachyon. Thus, (0;|Vcsyv) is equal to the usual 
Veneziano vertex function.) 

We will show that this vertex function, up to a transformation generated by 
the L_,, is equal to the symmetric vertex. To show this, let us first analyze the 
action of a conformal transformation ona coherent state (which was introduced 
in (2.6.18)): 


[z, k) = edn bet@n” 19), (7.6.5) 


By a direct calculation, we can show that the Virasoro operator acting on this 
state yields 


L, |z,k) =[(n + Dk22" + 2"*"'d.] Jz, k). (7.6.6) 


This is the transformation of a coherent state when one L,, is applied. Now we 
wish to apply an arbitrary number of the L,,’s to the coherent state: 


oo d 
Noakes Lala les fee rE | [ze (7.6.7) 
n=0 “ 
where we define function f(z) as 
fCy=e ya... (7.6.8) 
n=0 


Thus, a general conformal transformation generates the following transforma- 
tion on a coherent state: 


—\ k? 
~ ‘alee 2 
ern=o%bn |e x) — (=) Iz, k), (7.6.9) 
where 
z = g'(e(z) + 1) (7.6.10) 
and 
dg : 
ai 1(z). (7.6.11) 


Equation (7.6.9) should be compared with (4.1.8), which gives the transfor- 
mation of a field of weight / under the conformal group. 

Armed with these results, we can now show the equivalence between the two 
on-shell vertex functions. We begin by contracting the three-Reggeon vertex 
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function on three arbitrary coherent states [32]: 


3 


3 
[ [«. zilVcsv) = gle 


I i=l 


kjkixt 


|i —z, |", (7.6.12) 


i “FE Zi+l 


This can be calculated explicitly from the definition of the C matrices 
in (7.6.4). 

The next step is to contract the symmetric BRST vertex (7.4.11) with three 
arbitrary coherent states and compare it with the expression given above. We 
will need the following formulas: 


12 nm 
y eo log 
nim 


Ze Nezies log 


AUS) 1—2(@) ’ 
1 _ 2(z3) — 1 | 
Az) 2(z3) 


7 
+ log|z3|} +logi—|__, (7.6.13) 
dz = 
See, di 
Nam 22" = fee eee | 
dz dz 
Twila = log |—— = | — tog || an 


nim 


These identities allow us to construct an explicit form for the matrix element 
between the coherent state ram (k;, z;| and the symmetric BRST vertex func- 
tion. When we compare this expression to (7.6.12), we find the relationship 
between the old CSV vertex and the newer symmetric BRST vertex: 


3 3 aan 
J] i zVesv) = []eeo. kv) [| (2) (76.14) 
Zl H 


rl f= 


This is the expression that we want. It shows that the matrix element of the 
CSV vertex and the matrix element of the symmetric BRST vertex are re- 
lated by a factor raised to the k? power. Now compare this with (7.6.9), 
which shows that this factor can be reexpressed in terms of the L,,’s. In 
summary, we now have an explicit solution for (7.6.1) (at least on coherent 
states), which demonstrates that the CSV vertex and the BRST vertex are re- 
lated by a conformal transformation and hence have the same on-shell matrix 
_ elements. 

There is yet another way to show the equivalence of the CSV vertex with 
the symmetric vertex, which is based on a power expansion of (7.4.17), rather 
than on using the conformal properties of complex functions. We know that 
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L,, acts like z"*+'d. on a complex function. Let us define 

8 + DG—DE—s)—(e —2 
ZA erg) 

= 1thzet+ beter. 


Wo) 


It is always possible to find a series of coefficients a, such that 
Vi ere 


In practice, solving for these coefficients might be difficult, but in principle 
these coefficients are known to any degree of accuracy. For example, a) = 5 


and a) = =. Then it can be shown that the symmetric vertex and the CSV 


vertex are related by the following conformal transformation: 


3 r =s 3 oc r 
(V| e In(3V3)/4 Yo7_) (L§—-1) = (Vesy| ée Da Dae a, Ly, x 


Although we have now established the link between the symmetric vertex 
of Witten and the CSV vertex, we are still facing the problem that there are 
actually nvo BRST string field theories that do not seem to have any relationship 
to each other. The other BRST formulation of the interacting theory is based 
on the old light cone vertex, where strings simply split or rejoin at an interior 
point. We can also postulate that the three-string vertex function consists of 
the covariant Dirac delta function [18-24]: 


3 
via | DX 123 |X) |X2) 1X3) [[¢ (>: r4cr)8aon ; (Feel 5) 
oO r=1 


It is important to realize that this new vertex function is precisely the same as 
the old light cone vertex function (6.4.5), except that the harmonic oscillators 
are now fully covariant, rather than just transverse, and that we must also 
multiply it with the ghost vertex. When we multiply these two, we have the 
“covariantized light cone” vertex. 

We can check that the covariantized light cone vertex satisfies BRST 
invariance: 


3 
DO 0: (7.6.16) 


f= 


It is also possible to prove that this covariantized version of the light cone 
vertex is equivalent to the on-shell CSV three-Reggeon vertex function. We 
will use a different proot, however, from the one used before. This time, we 
will show that the continuity or overlap conditions satisfied by the CSV vertex 
function can be written after a conformal transformation as the continuity 
condition for the light cone vertex function. Since the continuity or overlap 
conditions define the vertex, it must follow that the two vertices are related by 
a conformal transformation. 
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Let us write the CSV vertex function as 


3 0° 
|Vesv) = exp (» oS aa) 0) , (7.6.17) 


ple 


where 
C= (-lI Tope 1 4mP- a fn — mi). (7.6.18) 


We begin with the assumption that we can impose the following continuity 
condition on this vertex function 


(A(z) PP [C. — z)'] + BQ) PL — 271] + CQ) PH [z]) |Vesv) = 0 (7.6.19) 


for some unknown values of A, B, and C and where P is given by 
oO 
ae) ae (7.6.20) 


By commuting the various harmonic oscillators through |Vcsy), we find the 
following constraints on A, B, and C: 

A C 

——+B-——  =0. (7.6.21) 

Z lz 
This still gives us wide latitude in choosing the values of A, B, and C. By 
carefully looking at where these functions converge, we find that we can set 
C = 0 when |z| > 1, B = 0 when |z — 1| < |z|, and A = 0 when |1 — | > 1. 
For convenience, we will select B = 0, which still leaves a certain degree of 
arbitrariness. Specifically, we choose 


1 — poz 
Sw. (7.6.22) 
1-—z 
- SS , 
2 


for some 7». 

Our goal is to show the relationship between this vertex and the usual 
light cone formalism, which maps the upper half-plane into the light cone 
configuration through 


p =a, In(z — 1) + a Inz. (7.6.23) 


As before, in the p-plane we parametrize the three-string vertex function in 
_ the o direction by the parameter 7,, where 


: | —Bon2 + Box, 0 < 73 < fox, 
33> 


7.6.24 
=pini a, four <3 <7, ( ) 
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and 
a} 
By = : 
|or3| 
po = = (7.6.25) 
a3 
r+) 
==; 
a, 
6, =el", 


Given these definitions, we can rewrite the A and C coefficients as 


6; a ee 
A= l—y)), 
d-7 a ™ 
63 dz 
C=-—. 7.6.26 
“o (7.6.26) 


Now comes the key step. We know that P transforms under the conformal 
group as follows (see 4.1.7), where we have taken the exponential of the usual 
variables: 


z dp d 
Ee. (< +) Pulp). (76:27) 
Thus, we can reexpress the continuity equation as 
65 d(l-zy',, 6&3 dz ') 
———__—_ ——_——- P, + ——P, }|V) =0. 7.6.28) 
laos pr dime oO 


Using the transformation properties of P under the conformal group. we now 
have 


(P (81) + P;'(83)) |V) = 0. (7.6.29) 


But 6, = e'”". This is the expression we desire. By a conformal transformation. 
we have created a new vertex that satisfies a different continuity condition on 
n,, which is the o coordinate of three strings. Thus, this new vertex function 
satisfies precisely the original continuity conditions of the light cone vertex 
function between strings | and 3. Similarly, by taking different combinations 
we can also show that the continuity conditions between strings 2 and 3 are 
also satisfied. Thus, we have shown that, by a conformal transformation, we 
can reexpress the continuity equation for the covariant Veneziano vertex in 
terms of the continuity equation in the covariantized light cone configuration. 

But since conformal transformations are generated by the L,, operators, this 
means that, on-shell, the covariantized light cone vertex function (7.6.15) is 
the same as the covariant Veneziano vertex (7.6.17). 

At this point, we must explain why there should be two versions of the BRST 
formalism, one based on the fully symmetric vertex and the other based on the 
light cone vertex, such that they both have the same on-shell properties. They do 
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not seem to resemble each other in crucial details. The covariantized light cone 
formalism, for example, has an additional four-string interaction term. This is 
required because the gauge group for the covariantized light cone theory does 
not close properly. If we try to adapt (7.5.3) for the light cone configuration, we 
have problems. The group doesn’t close without the addition of the four-string 
interaction, and even then it only closes on-shell. 

The symmetric theory, however, can be shown to be consistent without a 
four-string interaction. Thus, these two theories differ in fundamental ways. 

Although not all the details are clear, it seems that the resolution of the 
puzzle is that the second field theory, based on the covariantized light cone, 
is actually an incomplete theory. This is because we must integrate over all 
possible “lengths” of the string w,. In the old light cone theory, this integration 
was allowed because a, was really equal to an integration over the momentum 
of the string. However, in the covariant theory, a, is a redundant parameter, 
with no physical meaning. Thus, the integration over a, produces an infinite 
number of copies of the same thing. Thus, for example, in the zero slope limit 
this theory produces an infinite number of Yang—Mills actions: 


] lo) 
a | daF,,(ay +++, (7.6.30) 
0 


which is clearly nonsense. This infinite number of copies of the same thing, due 
to the integration over the fictitious parameter a, yields an infinite redundancy 
in our field theory. Thus, there is a fundamental problem with the second BRST 
approach. 

The resolution of this problem is that w must be expressed as a genuine 
gauge parameter and hence can be gauge-fixed. However, within the present 
BRST formalism it is impossible to make @ into a genuine gauge parameter. 
We must vastly increase the number of fields in the theory in order to gauge 
the “length” of the string, and this is possible only in the geometric formalism 
(see also [37, 38]). 
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Unfortunately, closed strings and superstrings are less developed than the 
bosonic open string. In the BRST formalism, for example, there exists con- 
siderable confusion over how to write the correct theory. For closed strings 
and superstrings, the ‘‘ghost counting” comes out all wrong. In fact, the naive 
BRST action vanishes exactly because it has the wrong ghost counting number, 
which necessitates a “consistent truncation” of either Q or the Hilbert space 
of the “ghost modes.” 

For the closed string [11, 39, 40], we have two duplicate copies of the 
-Virasoro generators L, and L,,. In the BRST formalism, the vacuum therefore 
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possesses double the previous ghost number because —} ~ $ = —1: 
|0) =|-)|-), (0) = (—IteI., (7.7.1) 


while Q remains with ghost number |. This means that the naive action must 
actually vanish 

(| O|%) = 0 eg.Z) 
because the ghost counting does not come out correctly, t.e., -l1+1—140. 
This is clearly undesirable. Various solutions to this ghost counting problem 
have been proposed, none of them totally satisfactory. For example, we can 


artificially truncate the zero modes within Q, keeping its nilpotency property 
intact. This means that the zero modes 


co; bo : Go; bo G73) 
will be arbitrarily removed and replaced by the smaller set 
Co; bo (7.7.4) 
such that the new operator O now decomposes as 
Q=6(K+K)+d+8+d+5—26(R + R). (7.7.5) 


Notice that the old zero modes have been dropped and a smaller set (which 
resembles the set used for the open string) has been inserted. 

In order to have the theory independent of the origin of the o coordinate, 
we must impose the constraint on the closed string: 


(K — K)|®) = 0. (7.7.6) 


The fact that we have a smaller set of zero modes means that we can use the 
vacua found earlier for the open string case: |—) and |+), 


|b) = ¢|-) +/+). (7.7.7) 
Notice that this truncation was chosen so that we retain the identity 
QO? =0. (718) 
The action is 
(P| Q|®). (7.7.9) 


Now let us turn to the supersymmetric case, where the situation is much 
worse. For the Neveu-Schwarz bosons, we actually have no problem with 
BRST zero mode ghost counting because the commuting ghosts y. f are 
half-integral moded, and hence do not change the nature of the vacuum. 
The BRST vacuum still has ghost number —!. Thus, the BRST action is 
still 5 


(A| QA). 910) 
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However, for the Ramond fermions [11, 41-53], there are severe problems 
with the integrally moded commuting Faddeev—Popov ghosts. Let yo be the 
zero mode of the ghost fermion oscillator. Then there is an infinite number of 
possible vacua because this ghost is bosonic in nature, not fermionic. Each of 
this infinite set of vacua is labeled by 


vo 10). ; Gaal 


This means that the field functional must be power expanded into a series of 
infinite vacua 


®(X, yo) = > bn(X)¥f. (2) 


Which vacuum do we use? This is extremely inconvenient, and it shows the lim- 
itations of the BRST approach. (A similar situation occurs in conformal field 
theory with “picture” changing operators. However, conformal field theory is 
an on-shell formalism, so all these pictures collapse. Here, the string field 
theory, by definition, is off-shell, and all these vacua must enter into the 
theory.) 

There are several proposals for truncating the theory in the BRST approach. 
We can either truncate Q, keeping it nilpotent, or truncate the field |®) so 
we do not have to sum over an infinite number of ghost vacua. Let us begin 
our discussion by introducing a unified notation by which we can describe the 
ghost fields for the Neveu-Schwarz and Ramond models simultaneously. Let 
us define the following [11]: 


Ay = (on, Un) . 
By = (bn, Ba), (eas) 
Cy = (Cn, Ya); 
where A‘, labels both the usual bosonic string and the anticommuting field y**, 
the dot reminds us that the fields can be integrally or half-integrally moded, 
and By and Cy represent the ghost fields for theory. In particular, b, and c, 
are the ghosts corresponding to the bosonic string, and f, and y, label the 
ghosts for the fermionic field, where the dot again can be either integrally or 
half-integrally moded. The commutation relations are 
AW Ay, —(—1)MN AX AK = 08M +N), 
ByGy (1) Gyby—o(M 4), (7.7.14) 
where (—1)”" is —1 is M and N are fermionic and equal to +1 


otherwise. 
Tee = Peer, then the BRST operator, in this notation, is 


O =C_yLy + Fyy:C_yC_1Bx:. (7.7.15) 


’ Where Ly is equal to the usual Virasoro operators. 


320 7. BRST Field Theory 


Specifically, for the Ramond string, the operator is 


O = cK yok +d Pie Rone 2 Rear ia oe (7.7.16) 
where 
Re MG 2aG,,, (7 
and 
F = Fy + f¥(C™ Bu — B_-u(—1)" Cal. 718) 


where fr equals 2 if M is bosonic and M/2 if M is fermionic, the bar 
interchanges fermionic and bosonic modes, and d, K, 6, R are equal to their 
counterparts for the Virasoro case. 

Fy is the usual Ramond operator, which satisfies the following commutation 
relations that define the algebra of Superdiff(5): 


{Gms Gn} = 2Lmin + 4D(m? — 4)8m.—ns 

[Lm, Gn] = (5m —1)Gmin, 

[Ls Fn] = (4m — 1) Finns (7.7.19) 
(Fins Fp VEQ2L ment 5 DMC Oman, 


D 
(Lin, Ln] = (nm — ny) Eman + re ai 
Notice that the algebra of Superdiff($), which is usually written in terms 
of physical fields, can be realized on the ghost fields By and Ca as 
well. 
The operators F, K, etc., satisfy, in turn, the following commutation 
relations: 


ad’ =§*=0, 

F? = K. 

2R =[F, R], (7.7.20) 
(d, 5} = F{F, R}. 


These commutation relations, in turn, are sufficient to show that the BRST 
operator Q is nilpotent. 

As we mentioned before, the problem with the Ramond sector is that 
the ghost mode yo is integrally moded and satisfies commutation, not an- 
ticommutation, relations, so there is an infinite number of ghost vacua 
described by y' |0) for each n. One way out of this problem is to arbi- 
trarily truncate the zero modes of the operator Q while maintaining its 
nilpotency. 

Now we reach the crucial step: as in the case of the bosonic closed string, 
we will arbitrarily eliminate the bosonic ghost modes yo. Bo and introduce 


~ 
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fermionic by and Co such that 
by |6) = 0, 
(6) eo 6) a0 (7.7.21) 


Let us define a modified Q that simply banishes the zero modes yo and fo 
from the theory. We define 


O = tF + (-1)"*(d + 8) — b(FR+ RF), (7.7.22) 


where Nr counts the total number of fermion creation operators in a state. 
Notice that Q? = 0, which was the guiding principle in defining this modified 
operator. 

To define the Ramond states, we define the GSO operator, which in this 
representation is 


C= lberecoll (in (7.7.23) 


where N- refers only to nonzero models and I"!! refers to the 10-dimensional 
chiral Dirac matrix. In this representation, the GSO projection simply 
eliminates states with the wrong spin statistics. 

An arbitrary Ramond superfield can be decomposed in this truncated Fock 
space as 


IY) = []v) + 20 6)1 6), (7.7.28) 


where |) is a zero-form of chirality P'! = 1 and |&) isa.G = +1(—1)-form 
and 


bo |0) =0, 
(0 Ge 0) =e (7.7.25) 
The truncated action now reads 
S=(v| QI¥), (7.7.26) 


where (w| = (W| 1°. The action is invariant under 
6|WV) = Q|A). (7.727) 
Substituting our previous decomposition of |W) in (7.7.24) yields 
S=(b| Fly) +(E| FR+RF IE) — (wld +51) —(§|d +5 |W). (7.7.28) 


The above action has also been found in other ways, such as making a clever 
choice in the truncation of |W) and preserving the original Q. This method 
also yields the same result. 
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The situation becomes much worse, however, when we discuss fermionic 
closed strings. In this case, the constraint [K — K]|) = 0 turns into 


[F — F]|w) =0 (7.7.29) 


which is a dynamical constraint. Hence, we are not able to apply this constraint 
to the states. 

Let us, however, present one more truncation operation [17] that holds 
promise of being local in o and hence generalizes to interactions. Let us 
recall the original ghost counting that was performed when we constructed 
the * and f operation. The ghost numbers were carefully chosen so that the 
gauge symmetry closed and the axioms of cohomology (7.5.8) to (7.5.12) were 
satisfied: 


Object Ghost Number 


oe ed 
Cen een bee en eee ra 


“> *O€ 


It’s easy to show that the above choice of ghost numbers satisfies the 
cohomological axioms that we listed before. 

Now let us generalize these rules for the open superstring case. We wish 
to maintain the cohomological axioms and in addition wish to enforce the 
conditions that two gauge transformations yield a third gauge transformation 
with the same ghost number. 

Given these conditions, we quickly find that the « and the / that we defined 
have the wrong ghost numbers. In fact, as we found before. the action is 
exactly zero. Instead of “truncating” the zero modes. as we did before, /es 
us use conformal field theory to supply us with the missing ghost numbers. 
These insertions from BRST ghost fields will be only at the midpoint, so we 
do not lose locality in o. Thus, this truncation, which truncates only ghost 
numbers of fields, is superior to the previous truncation. which truncates zero 
modes. 

Several new features must now be added to make this scheme work. 


(1) The superfield Y is now the sum of an NS field a@ and an R field w: 


VW = (a, w). (7:57:30) 
(2) Similarly, the gauge parameter A is also the sum of two pieces: 
N= WeY )s CIs} 


(3) The ghost number of * is now equal to 4 (because the conformal 
ghosts contribute ;, as before in (7.4.16), but now the superconformal 
ghosts contribute =: Thus, we must invent a new multiplication rule 
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for the superstring, which will have ghost number 3 (which we will 
call ®). 

(4) The integral { now has ghost number —i because of the contribution from 
the superconformal ghosts. We need to postulate the existence of a new 


integration ¢ which has ghost number —3. 


Now, by trial and error, we can construct the ghost numbers of all of our fields 


and our operators such that we satisfy closure of the gauge transformation and 
the previous axioms [17]: 


Object Ghost Number 


a 3 
" 0 
é = 
Xx —| 
2 eS (7.7.32) 
3 
® 5 
X i 
y =i 
fo 3 
Jf = 


where X and ¥ are still unknown but they have ghost numbers to fill in the 
missing numbers. 

Now that we have made an educated guess that successfully satisfies our 
original assumptions, we are faced with the task of actually calculating explicit 
forms for these fields and operators ® and ¢. . 

First, it is not a problem to construct BRST fields with these ghost numbers. 
We simply choose the correct vacuum (from among an infinite number of them) 
such that it has the correct ghost number. Then we can always apply the ghost 
number projection operator on each state and obtain a generalized BRST field 
with the correct ghost number. 

More difficult, of course, is defining the operations f and ®. 

Since * has only ghost number +, we need a new conformal operator with 
ghost number | and conformal weight 0. Fortunately, conformal field theory 
gives us such an operator: 


he One t (ia733) 
Now, our multiplication rule for gauge parameters must obey 


With this new X operator, we can satisfy this multiplication rule: 


Do ge == {6}, X1) Os, X2) 
= (X(€) * &2) + x1 * X2, X(E1 * X2 + X1 *E2)). (7.7.35) 
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By looking up the ghost number of each field and each operator, we find that 
all ghost numbers are correct. To see this, let us symbolically write this multi- 
plication rule in terms of ghost numbers. We want to multiply two multiplets, 
with ghost numbers (—3, —1), such that we get back the (— 3, —1) multiplet. 
If we carefully write all the various ghost numbers in this multiplication rule, 


we find symbolically 
(-3,-N@(-3,-)=(1-34+4-3=-14+5-1,1-34+3-) 
= (3, —1). (7.7.36) 
Thus, we have closure. 
Next, we wish to show that A ® A, which occurs in the variation of A, and 


also has the correct ghost number of A. We wish to calculate the ghost numbers 
of the following operation: 


(a, hf) @(e,x) = (X(axe) +, X(W xe +a x)). 1.7.37) 
By carefully counting the ghost number of each piece, we find 
—4,0)@(-3,-)p=(1-$+4-3=04+4-1,140+3-4) 
= (—}, 0), (7738) 


so we have closure again. 
Finally, we need to construct the new ¢ operator that has ghost number — 5. 


The naive integral operator has only ghost number — : , So We need an operator 
that has ghost number — 1. Once again, conformal field theory supplies us with 
such an operator: 


Y(o) =c_d,£e°*. (7.7.39) 


This operator is called the “inverse picture changing operator” and is. in some 
sense, the inverse of the operator X. We can show 


so that X and Y are inverses of each other. With this new operator, our new 
integration operator is defined so that 


fra. v)= / Y ($x) ay, (77AN) 


i.e., the integration rule is the same as before except that we insert a factor of 
Fa iI) at the midpoint. Notice that this operator is defined only at the midpoint 
of the string, so that we still maintain a theory local ino. 

With these new ghost counting operators, we can now show that the 
following action: 


S= $(A@QA+3A@A@A) (7.7.42) 
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is invariant under 
dA=QA+AQBA-AQA. (7.7.43) 


The advantage of this formalism is that it generalizes to the interacting case. 
All operators are local in o, so there is no problem in applying the continuity 
conditions on interacting strings. 

Let us also say a few words about a promising formulation of string field 
theory that makes no mention of the background metric and may give clues to 
the origin of geometry. This new formulation is called the “pregeometrical” 
approach [54-56] and yields some surprising conclusions. 

Let us begin with the action 


je we * Dx, (7.7.44) 
eye 
At first glance, this action makes no sense. The action makes no mention of 
the background metric of space-time, which is desirable, but two things seem 
to be fatally wrong with the action: it has no kinetic term and its equations of 
motion seem trivial. To study this peculiar action, let us write the equation of 
motion 


b+ d—0, (7.7.45) 


which usually has the solution ® = 0, so the theory is empty in a point particle 
theory. However, the previous equation is actually shorthand for an infinite set 
of coupled equations corresponding to an infinite-component field theory, so it 
is no longer trivial for a string theory. Assume, for the moment, that a classical 
solution ®o exists for the equations of motion which is nonzero. It must satisfy 


Dp * Do = O. (7.7.46) 
Now power expand the field around this classical solution 
® > Do+ eh C.T4T) 


and reinsert this back into the Lagrangian. Because ®p satisfies the classical 
equations of motion, we have the new Lagrangian based on a new classical 
solution 


2 
L 220 ab; « b) =, (7.7.48) 


So far, nothing seems new. We still have not made any contact with a physical 
theory. Now let us make the crucial assumption behind this approach. We 
assume that an operator D exists that satisfies 


D® = Dy * ® — (—1)°D * Do. (7.7.49) 


If such an operator D exists, then we can show that it is nilpotent. Now insert 
this expression (7.7.49) back into the Lagrangian, which now becomes 


L=Ox« D+ 220°. (7.7.50) 
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If we can make the identification that this D is equal to the usual BRST Q, 
then we have shown that the action is precisely the BRST action with a kinetic 
term! Thus, the usual BRST action might emerge when we expand about a 
new classical solution of the theory. 

The novel feature of this approach is that nowhere have we made any men- 
tion of the background space-time metric. The background occurs only in the 
kinetic term, not the interacting term. In fact, the choice of the background 
metric emerges when we expand about a classical solution to the equations 
of motion. This is why this approach is called the “pregeometrical” theory. 
In principle, the geometric of space-time should emerge as one among many 
possible vacua. 

In practice, however, we must be careful to check for the consistency of 
our approach. The key equation was (7.7.49). Under certain assumptions, it 
appears possible to find solutions for this equation where D satisfies the axioms 
for the BRST string field theory. In this case, it appears that the usual flat-space 
BRST string field theory is nothing but one solution of the ©? action. Other 
solutions of this action presumably will yield BRST theories defined with 
different classical metrics. 

For example, let us choose 


= OT. (i. 1) 


where Q, is the BRST operator defined only on the left-hand side of the 
midpoint (for open strings) and / is the identity operator (which equals | if the 
left half and right half of the string coincide and zero otherwise.) The original 
BRST Q operator is equal to Q; + Qr. If we make the substitution in the 
original action 


b= 0,1 +20. (7.7 Sm) 


we find that we recover the original BRST action (7.5.7). (The right-hand side 
of Q emerges in the manipulations because Org/ = —Q,1/.) 

Surprisingly, these definitions can be shown to be consistent with the original 
five axioms in Section 7.5, so that the usual BRST string action seems to be 
one among many possible solutions to (7.7.45). 

So far, it can be shown that flat space is a consistent solution to (7.7.45). It 
remains to be seen, however, what other kinds of classical backgrounds can be 
found as solutions to the equations of motion. 

Lastly, Ict us now discuss some of the new directions taken in string field 
theory. The first approach is to correct a hidden defect in the open superstring 
theory described earlier. The superstring field theory is actually anomalous 
[57]. For example, when calculating the four-point scattering amplitude, the 
three-string vertices meet on the world sheet, such that two picture changing 
operators X collide, yielding a divergence. We can show, using conformal 
field theory, that two picture changing operators, defined at the same point, 
yields an infinity. Not only are the amplitudes divergent when integrating over 
the moduli space of the disk, but the gauge symmetry itself is anomalous, 


if 
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since picture changing operators collide at the midpoint when multiplying two 
strings. Thus, the problem is hidden within the algebra itself. 

There have been some attempts to solve this problem by playing with the 
picture changing operators, by altering the ghost number of the strings, un- 
til the divergence disappears. Since all pictures must necessarily yield the 
same on-shell superstring amplitudes, one solution is to change the pic- 
ture in which the theory is formulated. In other words, the action, which is 
off-shell, may be defined in various pictures, but the on-shell S-matrix is 
unchanged. 

A second development concerns the closed string, perhaps the most urgent 
question facing string field theory. Naively, we might start with a closed string 
field theory such that all strings have the same lengths and then construct a 
cubic action. However, Kaku and Lykken [58] have shown that the resulting 
amplitude does not fully reproduce the Shapiro—Virasoro amplitude, i.e., there 
is a missing region of the integration region. They postulated the existence 
of a a four-string interaction necessary to restore modular invariance and the 
correct integration region. By unitarity, there are therefore an infinite number 
of such diagrams necessary to restore modular invariance, i.e., the action must 
be nonpolynomial. 

Finally, Kaku and groups at MIT and Kyoto [59, 60, 61] established the 
full nonpolynomial closed string field theory. Not only is the gauge invari- 
ance based on nonpolynomial interactions of closed strings, the action is also 
nonpolynomial, such that the complete modular region is recovered, piece by 
piece, when integrating over the moduli space of N-point functions. A general- 
ization of this bosonic string field theory to the superstring case, however, has 
been less successful. Because the closed string field theory is nonpolynomial, 
this magnifies the problem of colliding picture changing operators. Now, there 
are an arbitrarily large number of picture changing operators which can collide 
on the world sheet. 

A third development is the construction of a nonpolynomial closed string 
field theory for the D = 2 closed string case [62]. In two dimensions, string 
theory can be reformulated in terms of a Liouville theory. In two dimensions, 
the action retains its basic nonpolynomial structure, except that the states are 
much simpler and the ghost insertions at the midpoints are different, in order 
to satisfy ghost counting constraints. 

A fourth development concerns a geometric string field theory, one based 
on simple postulates which may unify all these various string field theories 
[63]. A geometric theory gauges the string length, so we can “interpolate” 
between Witten’s formulation (based on fixed string lengths) and the light- 
cone-like formulation (based on variable string lengths). This unifies the two 
BRST formalisms, which are now seen to be specific gauges of the same 
underlying theory. The first BRST theory is defined in the “midpoint” gauge, 
where strings interact at their midpoints, while the second BRST theory is 
defined in the “endpoint” gauge, where strings interact at their endpoints, as 
in the light-cone theory. 
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Lastly, we should conclude by saying that string field theory is still an 
evolving subject. As a result, it has not yet lived up to its promise of yielding 
nonperturbative information about string theory. Although string field theory 
is defined independent of perturbation theory, it is still too difficult to per- 
form nonperturbative calculations. By contrast, M-theory has already yielded 
a wealth of information about the nonperturbative nature of string theory, 
although in a nonrigorous fashion. 

It remains to be seen if M-theory can be formulated in a second quantized 
fashion, as in string theory. 


7.8 Summary 


The origin of the covariantized gauge approach was to construct a theory 
invariant under 


5 |) = ee IAn)- (7.8.1) 
n=1 


We postulated this invariance based on an analogy with the Yang—Mills theory, 
which is also invariant under a field that rotates into ghost fields. In fact, if we 
decompose the previous equation, we will derive precisely the linear part of 
the variation of the Yang—Mills theory 


(2 Pleo Pe 
where P is a projection operator. By a power expansion, we can find the exact 
solution to this gauge problem 
(®| (Lo — $L,L_1 +---)|®). (7.8.2) 


At the free level, this reproduces the usual Maxwell action. The solution can 
also be written to higher orders by brute force 


P(Lo — 1)P = Lo — 1 — $L_)L) + L2,,(4L9 + 4D — 9)AL} 
+ L? ,(6Lo + 6)AL) + hic. 
— Leg(4Lo + 2)\Qhact 2)Adig + :--, (7.8.3) 
where 
A = 2(16L2 + (2D — 10)Lo + D)"! (7.8.4) 


and P is a projection operator that eliminates ghost fields. Notice that this 
theory is nonlocal, meaning that we must incorporate auxiliary fields to soak 
up the nonlocal terms. 

The necessity for auxiliary fields already shows up in the BRST quantization 
approach. Notice that the Faddeev—Popov ghosts propagate in the free theory: 


P(Ox + 8:)0' + p5(e — 3). (7.8.5) 
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This means that our path integral must be altered to take into account these 
extra degrees of freedom. The Hilbert space of our field functional must now 
be expanded: 


i312) ix) | DxD6 (IEG. le 
(O| (X|®) = ®(X, 8). (7.8.6) 
This means that our Green’s functions must now be expanded to read 


X).8) 
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(7.8.7) 


The BRST quantization introduces a new nilpotent operator Q such that 


CO 
O = cp(Lo — 1) + ) [ephg tb bncal 
=) 


(oe) 


oe ‘CnC _pOnamin — Nn), (7.8.8) 


Q’ =0. (7.8.9) 
Thus, our free action can simply be expressed as 


L = (%|Q|®), 
Sols (7.8.10) 


The generalization of this to the interacting theory can also be performed 
using Neumann function techniques. We need a conformal mapping that will 
take the upper half-plane into a fully symmetric configuration. One choice is 
to splice together the mapping of six charges to produce the three-string vertex 


+3 =, 
we be eZ) =n “ = ~in/2, (7.8.11) 
QQ = a2 = a3 = Ilr 
y= a7 — a; — |, 
2) = eit/6 zy = edt! (7.8.12) 
oy = err Z-1 = —Z1; 
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This allows us to write an explicit formula for the vertex function: 


|Vio3) = exp Ea St Panne oe 3 | |O123) . 


f—I 
(7.8.13) 
One major advance in the BRST theory is a formalism where everything can 
be reduced down to the basic assumptions of the theory. It turns out that with 
five axioms, which are simply postulated with no derivation, we can derive the 
entire BRST theory: 


(1) Existence of a nilpotent derivation 


Q? =0. (7.8.14) 
(2) Associativity of the * product 
[Ax B]*C=Ax*[Bx*C]. (78:13) 
(3) Leibnitz rule 
O[A * B] = QA* B+(—1)*A * OB. (7.8.16) 
(4) The product rule 
faxp=co [aa (7.81 
(5) The integration rule 
i OA=9. (7.8.18) 


With these rules, we can now show that the invariant action is 
L= [ ox90+30x0x® (7.8.19) 


which is a Chern—Simons term. 
Lastly, we can show that the three-string vertex function is equivalent on- 
shell to the usual Veneziano vertex: 


|W 123) = [oo oases |v. (7.8.20) 


n>0 


Thus 
(phy|V\23) = (phy| V 123). (7.8.21) 


The embarrassing thing, however, is that we can also write the covariantized 
light cone vertex and show that it is also equivalent on-shell to the usual 
Veneziano vertex. Thus, we are faced with two equivalent BRST vertex 
functions, both of which produce the same on-shell vertices. 

Although the open bosonic string was relatively easy to write in BRST 
language, the superstring and closed string have been much less successful in 
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the BRST approach. The problem is that naive ghost counting of the superstring 
and closed string actions yields the wrong results: 


(b| Q|®) = 0. 


In order to change the ghost counting, there have been several proposals to 
“truncate” either the Hilbert space or the Q operator so that the correct ghost 
counting is achieved. The problem with this approach is that the zero modes 
are treated differently from the other modes, so that locality in o is probably 
violated, and hence these approaches probably cannot be generalized to the 
interacting case. 

A more promising approach is to introduce bosonized ghost operators that 
have the correct missing ghost numbers and insert them at the midpoints where 
we do not lose o locality. 

Lastly, we mention some new directions: 


(a) A background-independent string field theory apparently may be con- 
structed by simply deleting the kinetic term of the action. 

(b) The open superstring action is actually anomalous because of colliding 
picture changing operators. This problem may be solved by changing the 
picture in which the action is defined. 

(c) A bosonic closed string field theory (with fixed string lengths) is neces- 
sarily nonpolynomial. The gauge-invariant string field theory action for 
bosonic closed strings has been explicitly constructed. However, it is not 
known how to generalize this to superstrings. 

(d) The action for D = 2 closed string field theory has been constructed which 
reproduces Liouville theory. 

(e) A geometric string field theory has been developed in which in the string 
length has been gauged, so that Witten’s string field theory emerges in a 
“midpoint gauge,’ while the light-cone-like theory emerges in an “endpoint 
gauge.” 


The power of string field theory is that it is formulated non-perturbatively 
in a simple action. However, the theory has not yet lived up to its promise. 
The theory is well defined but is still too difficult to extract nonperturbative 
information. By contrast, M-theory, although it is ill defined mathematically, 
has yielded a wealth of nonperturbative information. It remains to be seen if 
M-theory can be formulated in a second quantized language. 
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Part Ill 


Phenomenology 
and Model Building 


CHAPTER 8 


Anomalies and the 
Atiyah—Singer Theorem 


8.1 Beyond GUT Phenomenology 


Ideally, we would wanta truly unified theory of all known interactions to satisfy 
at least two criteria: 


(1) It must be based on simple physical assumptions, expressed in terms of a 
new geometry, which will allow no more than one coupling constant. 

(2) It must yield a finite theory of gravity coupled to the minimal SU(3) ® 
SU(2) ® U(1) model of particle interactions. 


So far in this book, we have only begun to explore the first possibility 
by demonstrating that a second quantized field theory exists that is based 
on two physical principles. However, the developments we have presented in 
string theory have been purely formal. Unless we can make contact with the 
known experimental data, then the theory, no matter how elegant, must be 
discarded. The true test of a unified field theory is that it can reproduce the 
known experimental data at low energies. 

The problem, however, is that dimensional breaking from a 10-dimensional 
theory down to four dimensions can occur only nonperturbatively. To any 
finite order in the perturbation theory, the dimension of space-time seems 
perfectly stable. In general, field theory is the only reliable formalism in which 
to perform nonperturbative calculations, because the first quantized formalism 
is necessarily perturbative. Unfortunately, we do not yet understand how to 
perform realistic nonperturbative calculations in string theory, mainly because 
the field theory of strings is still in its infancy. Thus, physicists have not been 
able to calculate the stability of any classical vacuum solution. Hence we will 
not deal with the question of quantum stability in Part III of this book and will 

’ concentrate exclusively on classical solutions of the equations of motion. 
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Given this important restriction, it is surprising that initial attempts at ex- 
ploring the experimental consequences of classical string theory have produced 
a wealth of new phenomenology that takes us beyond GUTs. In Part III, we 
will first ask whether the string theory is compatible with the results of the 
standard GUT theory. Specifically, we will ask whether it can reproduce the 
GUT theories of SU(5), O(10), or Eg. In this respect, the string theory has had 
some measure of success. We will show in Chapter 10, for example, that the 
Ex ® Eg heterotic string can easily be broken down classically to E,, which 
has chiral fermion solutions and an acceptable GUT phenomenology. 

However, we must also demand that the string theory go beyond the standard 
phenomenology of the GUT theory. Specifically, we must ask the following 
questions of the string model: 


(1) Can it explain three generations of chiral fermions? 

(2) Can it explain the experimental results on proton decay? 

(3) Can it explain the smallness of the electron mass? 

(4) Can it explain the vanishing of the cosmological constant after supersym- 
metry breaking? 


Although it is still too early to say, there are indications that the string theory 
is rich enough in content that it contains mathematical mechanisms which 
may answer the above questions. Specifically, new topological mechanisms 
enter into the picture, such that basic phenomenological concepts. such as 
generation number, are now recast in a new topological language. Jopology 
is the crucial new mathematical feature that allows us to go beyond standard 
GUT phenomenology. 

We will begin this chapter with a discussion of the isopin groups that are 
allowed in string theory. We will find that the properties of the S-matrix, such 
as cyclic symmetry and factorization, provide only the weakest restrictions 
on the gauge group of the theory. Then we will show that the cancellation of 
anomalies in string theory places stringent conditions on which gauge groups 
may be allowed by the theory. Basically, the gauge group of a supersymmetric 
theory must contain exactly 496 generators, which restricts us to either SO(32) 
or Eg ® Eg. 

To understand how the cancellation process takes place. we will find it con- 
venient to review some of the elementary properties of characteristic classes. 
In particular, new developments in supersymmetry have now made it possible 
to prove the Ativuah—Singer index theorem trom a simple Lagrangian. Tradi- 
tionally, the proof of the Atiyah -Singer theorem has been inaccessible to most 
physicists because of the intricacies of the mathematical formulation. 

We begin our discussion of phenomenology by introducing isospin into the 
model via Chan—Paton factors [1]. Ever since the early days of string theory, 
it was known that isospin factors can be introduced trivially into the model 
by a simple multiplicative factor. (In the next chapter, we will discuss a much 
more sophisticated way to introduce gauge groups via compactification and 
Kac—Moody algebras.) 


8.1 Beyond GUT Phenomenology Boo 


The Chan-Paton method produces the scattering amplitude T by simply 
multiplying the Veneziano—Born term A by the trace of isospin matrices, both 
of which are manifestly cyclically symmetric, and then summing over various 
permutations of the external legs: 


P28 NS > WA Aedes adnA( lee 3, cme). (8.1.1) 


perm 


Because the trace is cyclically symmetric regardless of the isospin group, we 
have no restrictions on the choice of the group itself. Thus, we wish to impose 
additional constraints that will allow us to eliminate some of the nonphysical 
choices of the gauge group. We first insert a complete set of intermediate states 
into the scattering amplitude. We then demand: 


(1) that the amplitude 7 factorize explicitly; 

(2) that the twisting of external legs yield an eigenvalue of +1; and 

(3) that the massless Yang—Mills particle on the external legs and in the in- 
ternal factorized channel belong to the adjoint representation of the gauge 


group. 


Let us begin by imposing the first condition, that the amplitude factorize 
explicitly. In Fig. 8.1, we divide up the N external particles into two clusters, 
a and b, and let the intermediate line represent the particle X. The left-hand 
piece represents the scattering of particles a into X, while the right-hand piece 
represents the particle X breaking up into particles b. Following (5.1.7), we 


ane 


FIGURE 8.1. Constraints imposed by unitarity. Isospin can be introduced into the 

model by multiplying the amplitudes by Chan—Paton isospin factors, which must 

obey stringent conditions arising from unitarity. The factorized scattering amplitude 

must consist of the sum of Chan—Paton factors where the external lines are arranged in 

both clockwise and counterclockwise order. (X represents a collection of factorized 
, States.) 
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find 


BGl 2. N= ~ YoT(a> X)T(b> X)+---. (8.1.2) 
Xe 


2 
X 


Now let us reexpress this factorization formula in terms of Veneziano 
amplitudes. Each of the amplitudes obeys 


I 


A(1,2,...,N) = ; 


nee CA aia meer 
Me 


a3) 
Notice that the Veneziano amplitudes are automatically factorizable and cycli- 
cally symmetric, so the factor T(a — X) must contain two terms, with both 
cyclic and anticyclic ordering of the external legs. Thus, the product of two 
T’s must contain 2 x 2 terms altogether. Let us now write these four terms 
explicitly: 


l 
Ta a soa¢ N) = e a {Tr(A,A2 ans *Ap—-1ApAp+1 oe -An—1ANn) 
mer ex 


XA, 2,025) — 1, Pep +1). ..,.0 — ae 
+ Tr(ApAp-1-** AzAtA pi 1 Apya* > * Aw) 
x Alp, p— ly... 2, hex 
x A(X, p+1,p+2,...,.N—1,N) 
+ Tr(AjA2 °°: ApANAN-1 °**Ap4i) 
x AC. ne Pa AUK Nis vox to 
+ Tr(ApAp—1-+: AtAwAw—1 ° ++ Ap4t) 
RAC Det ly A JACK, NG sag) oe he (8.1.4) 
Next, we wish to simplify this expression. We will now impose our second 
assumption, that the twist operator corresponds to 
9 ea (Pa ase (S13) 


Notice that if we twist all external legs, this changes the cyclic ordering into 
an anticyclic ordering. By assumption, we are considering only the scattering 
of massless vector particles, which have N = 0. Thus, the twist operator picks 
up a factor of (—1) for each external leg. Thus. the total contribution to the 
twist operator is 


AQ, 2.05.. 7; X) = (— 1a, po 1h 2): (8.1.6) 


Now let us collect all terms and arrive at some conclusions. After collecting 
terms, we find 


Tr{[Ayaz Ap — (—1)PApAp-1-°- Aa] 


X [ApstApza*+ Aw —(—1L)Y PANANat Ap+t}} 
mA], 2..... X)ACX, pee l,.... N). (8.1.7) 
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Now let us insert a complete set of isospin matrices A, in the above equation. 
We can always do this, so we take 


bap = Tr(AqAg). (8.1.8) 
Then we can write 
Tr{{Q1 non Ap =r De, aad -Ay] 
x [Apsti es Aw — (—1)" PAs Apa ]} 
= SO Tr{[Ay «+ -Ap—(-1)PAp + Ar DAad 


x TralApers**Ay —(—1)¥ Paw -+-Apeill- — (8.1.9) 


At this point, let us impose our third and final condition. The third criterion 
becomes the fact that the combination of lambda matrices in the brackets must 
in turn be part of the algebra of the group: 


A= Aye**Ap — (—1)P Ap - A. (8.1.10) 


Our conclusion is that the Chan—Paton factor, in order to preserve the adjoint 
representation for the massless vector particle, must have a gauge group for 
which the above combination of generators is also a generator of the algebra. 
This necessarily restricts our matrices to Usp(N), SO(N), and U(N). (Actually, 
U(N) is also ruled out, but for other reasons. We cannot consistently couple 
open and closed superstrings with U(N) because N = 2 supergravity cannot 
couple to N = | matter multiplets.) 

Unfortunately, this analysis does not yield any more constraints, so the model 
has very little predictive power. We will now look at the question of anomalies, 
which will fix the gauge group to be either SO(32) or Eg @ Eg. As in the case 
of gauge theories, where the cancellation of anomalies between the quarks and 
leptons plays a central role in model building, the cancellation of anomalies 
will play an important rule in fixing the gauge group of the string model. 


8.2 Anomalies and Feynman Diagrams 


Anomalies have a very profound origin [2-4]. They shed light on the deepest 
dynamics of quantum field theories. 

Simply, anomalies arise whenever the symmetries of the classical action do 
not carry over to the quantum level. The classical symmetries do not survive the 
process of regularizing the quantum theory. There are two types of anomalies, 
global and local. 

Global anomalies in gauge theories are actually welcomed. For example, a 
global anomaly in scale invariance in QCD with massless quarks might be re- 
sponsible for the generation of quark masses. Thus, a breakdown in global scale 

’ invariance through anomalies might be the origin of quark masses. Another 
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global anomaly might be responsible for the breakdown of U(N )rp @U(N ), in- 
variance that occurs in quark models of QCD down to SU(N) @ SU(N), which 
would be phenomenologically desirable. For the superstring model, however, 
global anomalies would be undesirable. For example, if global anomalies vi- 
olated the modular invariance of the multiloop amplitudes, this might have 
disastrous consequences for the internal consistency and finiteness of the 
theory. Fortunately, it can shown that this global anomaly in modular invariance 
is absent in the string theory [5]. 

By contrast, local anomalies in gauge theory and in superstring theory must 
be canceled at all cost, or else these theories make no sense quantum mechan- 
ically. For example, the elimination of chiral anomalies is one of the essential 
ways in which to build new models of quarks and leptons. In the standard model, 
the quarks and leptons have chiralities that just cancel the chiral anomaly. In 
superstring theory, local anomalies in conformal invariance and chiral symme- 
try must also be eliminated. The elimination of the conformal anomaly fixes 
the dimension of space-time and also the fermion content of the theory, while 
the elimination of chiral anomalies will fix the gauge group of the theory. 

We begin our discussion by examining the simplest local anomaly. the 
chiral anomaly, which occurs because the process of regularization (such as 
Pauli—Villars or dimensional regularization) does not respect chiral invariance. 
Specifically, if we have a theory with the invariance 


p> eh y, (8.2.1) 


then we would expect classically to have a conserved chiral current trom 
(1.9.8): 


0, J4> = a, (wysy4w) = 0. (8.2.2) 


However, there are complications due to quantization. The Pauli Villars 
method, for example, inserts an imaginary massive particle into the theory 
to make all Feynman diagrams converge 


1 1 i M2 —m-* 


p? +m? p?+m? p?+M? — (p?+m?)(p?+ M?) ( ) 


Propagators, which usually converge as p~°, now converge as p~*, thus making 
almost all divergent Feynman diagrams convergent. Once we have obtained a 
finite S-matrix, we set the mass M of the imaginary particle to go to infinity. 
However, mass terms explicitly break chiral invariance: 


Sw) £0. (8.2.4) 


Thus, the Pauli Villars regularization program does not respect this symmetry, 
and we expect that the divergence of the axial current is not conserved. He 
cannot preserve both conservation of the current and regularization of the 
Feynman amplitudes at the same time. But since the regularization of the theory 
is more important (or else we have no theory), this means that we must sacrifice 
current conservation. 
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Similarly, dimensional regularization will also fail to preserve the chi- 
ral anomaly. Dimensional regularization assumes that we can analytically 
continue the Feynman diagrams into complex-dimensional space by the 
substitution 


3 D-1 
DE een Pa 
nS) A=0 
i} cp / d? p. (8.2.5) 


It is easy to generalize the traces over Dirac matrices for vector particle cou- 
plings, but this breaks down for axial vector couplings because there is no 
generalization of ys; in complex space: 


Vs 7 Vp41- (8.2.6) 


The dimensional regularization program breaks down for chiral fermions 
because this regularization cannot generalize the ys; matrix to a complex 
dimension. 

We expect, therefore, that the divergence of the axial current is not conserved. 
In fact, it is the vector—vector—axial vector (VVA) diagram that explicitly breaks 
the conservation of the axial current. The triangle diagram poses regulariza- 
tion problems because each internal fermion line, which circulates within the 
triangle diagram, converges only as p~', which is not sufficient to give us 
a convergent graph and hence has ambiguities. By carefully regularizing the 
triangle diagram, we wind up with 


=i 
i) ie jee in pee as (8.2.7) 
which is a total derivative or a topological term, given by 
l 
- ER Tr (AgdgAy + 3AaAgAy)- (8.2.8) 


This result for the triangle graph was derived in four dimensions. However, 
the anomaly can also be generalized to D-dimensional chiral theories, where 
the triangle graph is now replaced by a polygon graph. Let us begin by writing 
the Feynman rules for external vector and axial vector particles interacting with 
a spin-+ fermion. Let us begin with N — 1 external vector particles and one 
axial vector particle of external momenta k", polarization ¢/(k), and isospin 
indices a, b, and c are interacting with a fermion of internal momenta p;'. The 
Feynman rules are 


p 
oy 
p? 
vertex (vector) > I" - fAg, (8.2.9) 
vertex (axial) > [p41T - Ag. 


propagator > 
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We assume that the external particles are on-shell, that the polarization vectors 
vanish when contracted on external momenta, and that the sum of the external 
momenta is zero, so that 


k? , =O, 
ki - (ki) = 9, (8.2.10) 


N 
Rs = 0). 
i=] 


Then Feynman’s rules for the polygon graph give us 


N 
T- p,T -¢; 
G~ f dp Tg ta TP PEEL + Pou . €B2611) 
i! i 


t 


Let us extract the most divergent part of this diagram. We notice that the trace 
over the term containing the 'p,; can be explicitly carried out 


Try, ie Pup Pps) Euipo-up* (8.2.12) 


Notice that the antisymmetric matrix arises only when we take the trace of DI 
matrices multiplied by p;,. Let us recombine the various factors of momenta, 
which must now be contracted onto the antisymmetric matrix. Notice that the 
conservation of momentum reduces the number of independent momentum 
factors by one. In general, the only terms that survive are of the form 


N 
] | 
G~ [arp tro. Rand TT peur ge Ree. (8.2.13) 
i—! i 


Notice that there is now a direct correlation between the number of external 
lines and the number of dimensions of spacetime because the antisymmetric 
tensor is of rank D. Thus, the leading divergence must satisfy the relationship 


D=2N —-2. (8.2.14) 


In four dimensions, this means the triangle graph is divergent. In 10 dimensions, 
this means the hexagon graph is divergent. Notice also that this gives us an 
explicit expression for the anomaly term. The external momenta &, ,, can be 
replaced by derivatives 0,,, and the polarization tensor ¢” can be replaced 
by the field A”, so that we can replace the contractions over Aig.) With a 
contraction over the Yang Mills tensor F,,,. Thus, after taking the Fourier 
transform, we find the correspondence 


Ka Os 
Cu Tm ae (8.2.15) 
Ajai me Fai 


~ 
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Combining all factors, we find [6-8] 
j(1/2)D 


Oy, Je? = D-l-(1/DD/1 
2P-\ 7701/2) (Ze): 


ee ee COMED (8.2.16) 
Similarly, there is an anomaly in the theory of gravity coupled to chiral 
fermions, which again is proportional to a total derivative or a topological 
term. If we couple external graviton legs to chiral fermions circulating in 
the interior of a single-loop graph, we can repeat our previous analysis with 
Feynman diagrams. The major difference comes from three factors: 


(1) We now must include the coupling of fermions to the energy-momentum 
tensor rather than the chiral current. The graviton coupling to fermions 
takes place through vierbeins (because GL(N) has no finite-dimensional 
spinor representations; see Appendix). 

(2) The external polarization vector ¢,, now becomes an external polarization 
tensor ¢,,,. 

(3) The vertex functions now contain higher tensor components. 


However, the leading divergent part of the diagram still contains the €,,, 5... 
tensor contracted onto external momenta k; and external polarization tensors 
Ey». Once again, it is trivial to recombine all factors and show that the leading 
divergence can be reassembled in x-space in terms of curvature tensors. For 
example, the coupling of an axial vector and two energy—momentum tensors 
in four dimensions yields 


va. a 
D, Jt? = — FG ie Mya sta (8.2.17) 


In the theory of gravity, there are two well-known total derivative terms, in 
two and four dimensions. First, there is the celebrated Gauss—Bonnet identity: 


1 
x(M) = 7 i} a’x,./gR = 2 —2g, (8.2.18) 


which we will prove later, where x(M) is the Euler number of a two- 

dimensional manifold M, R is the contracted curvature tensor, and g is the 

genus of a closed manifold (i.e., the number of holes or handles). This identity 

is possible because the scalar curvature in two dimensions is a total derivative. 
In four dimensions, we have the identity 


eH Tr Ryy Rap) = ASB Ro vgp + OVSBRi, + OVER 
= total derivative (3:29) 


for specific numbers a, b, and c. 

Usually, the integral of a total derivative is zero. However, this is not true if 
the manifold has a boundary or nontrivial topology. In this case, the integral 
over a total derivative picks up a term from the boundaries and the topologies. 
Thus, the integral over the contracted curvature tensor for a closed manifold is 
related to a linear function of the genus, or the number of holes in the surface, 
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which is a topological number. This is the origin of the relationship between 
anomalies, total derivatives, and topological numbers. 

Thus, both the gauge and gravitational anomalies can be derived from Feyn- 
man diagrams. However, for higher dimensions this becomes prohibitively 
difficult. Indices proliferate very rapidly for higher dimensions. Instead, to 
generalize these results for higher dimensions, we will derive these expressions 
from the functional formalism without appealing to Feynman diagrams. 


8.3 Anomalies in the Functional Formalism 


So far, we have been approaching anomalies in bits and pieces. What we 
would like to have is a systematic method with which to calculate all such total 
derivative terms for an arbitrary gauge group and arbitrary dimension. 

Let us begin with the functional expression for the fermion interacting with 
the external vector or gravitational particles: 


[ ovdies = [ ovow exp |- { [ a? 4 rovbu | 
(8.3.1) 


where the covariant derivative explicitly represents the coupling of the fermion 
to the external Yang—Mills and gravitational field: 


D =T*(0, + id7AS + wi? M%”), (8.3.2) 


where I’ are the gamma matrices in D dimensions multiplied by the vierbein 
field, and M“’ is a generator of the Lorentz group O(D — 1, 1). 

Notice that the functional integral is quadratic in the fermion fields, so 
the integral is a Gaussian. Using our results on integration over Grassmann 
variables in the Appendix, we can now perform the integral and obtain a deter- 
minant. Then we can reexponentiate the determinant back into the exponential 
to obtain a term (A, g) that is explicitly dependent on the gauge fields. By 
explicitly performing the Gaussian integration in (8.3.1), we find the following 
determinant: 


ef 48) = det(t(1+Tp+1)P), (8.3.3) 
where A and g are the Yang—Mills and graviton fields. We now usc the relation 
det Me™, (8.3.4) 


(This is most easily proved by making a similarity transformation that diago- 
nalizes the matrix M. Then M reduces toa diagonal matrix with its eigenvalues 
along the diagonal. In this form, the identity is trivial to prove. Finally, we make 
the reverse similarity transformation to restore the M-matrix.) Thus, we can 
take the log of (8.3.3): 


P(A, g) = In{det($(1 + Tp41) P]} 
= TrIn{$(1 + Pp41)D}. (8.3.5) 
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Now let us calculate the gauge variation of this field functional. Under the 
gauge transformation 


A, => A, — D,€, (8.3.6) 
we find that, after a Taylor expansion in ¢, a field functional transforms as 
P(A;,) = F(Ap — Dye) 
= I(A,)= [are de +5. 


m 


ér 
=T(A,)+ | d?xTr Du eee 
i 


= r(A,)+ fax Tred, J" ste (8:3.7) 


In the last expression for I"(A’,), we used the fact that the current generated 
by a gauge transformation is given by 


dI(A) 
= ; oF 
rs (8.3.8) 
We can rewrite the G appearing in (8.2.13) in functional language: 
) 
G = D,——T(A,). S50 
“5A, (Ay) (8.3.9) 
Then 
Di =. (8.3.10) 


Thus, nonconservation of the axial current means a nonvanishing G. 

The anomaly must also satisfy a self-consistency condition, the Wess— 
Zumino condition [9]. Notice that the generator of a gauge transformation 
is equal to 


: B= 5 = Pag (8.3.11) 

Then we have a new way of writing G: 
Gi) = 0,1 (Ay): (33,12) 
But we also know that the local gauge generators form a closed algebra. Thus, 
Suu One| = On, - (83213) 


Then this means that the G’s must satisfy the constraint 
6,,G(Az2) — (1 < 2) = G(A3). (8.3.14) 


If we write G as a variation of '(A,,), then it is obvious that the Wess—Zumino 
’ consistency condition is satisfied. However, if we write the anomaly term in 


348 8. Anomalies and the Atiyah-Singer Theorem 


terms of curvature tensors, then it is highly nontrivial that this self-consistency 
condition is satisfied. This then provides a powerful check on our computations. 

Now that we have a functional formalism in which to discuss anomalies, let 
us begin a discussion of the mathematics behind anomalies. Specifically, we 
will show that the anomaly can be written in terms of generalized characteristic 
classes that have been studied by mathematicians. Then we will show the most 
elegant part of the theory, namely that the integrals over these characteristic 
classes yield various index theorems. Thus, our strategy is to show 


Anomalies —> Characteristics classes — Index theorems. 


8.4 Anomalies and Characteristic Classes 


To study characteristic classes [10, 11], which will give us a systematic anal- 
ysis of all possible topological terms, we will use the language of forms (see 
Appendix.) The theory of forms, in some sense, only produces results that can 
be derived using the usual analytical methods of calculus. However, in higher 
dimensions, the number of indices rapidly proliferates beyond control. Thus, 
the powerful shorthand notation of the theory of forms allows us to rapidly ma- 
nipulate tensors of arbitrary rank in any dimension, which is prohibitive using 
standard tensor calculus. In Chapter 10 we will see that the theory of forms is 
the most convenient language for the theory of cohomology and homology. 

Armed with the theory of forms, we now construct a series of characteristic 
classes that will allow us to write by inspection the set of all topological terms 
for practically any dimension and any group. 

Let us now define an invariant polynomial that has the property 


P(aw) = P(g” 'ag), (8.4.1) 
where @ and g are group matrices. Examples of invariant polynomials are 
Det) +-a@), Tre*. (8.4.2) 
Now let &2 be a curvature two-form: 
&=dwo+oro (8.4.3) 
that satisfies the Bianchi identities 
dQ+oA2—(-1l"°QAe=0. (8.4.4) 


In the Appendix we prove two important statements for invariant polyno- 
mials: 


[1] aP(S)=0; 
[2] P(Q) = dQ, (8.4.5) 


for some Q. That is, an invariant polynomial based on curvature forms is both 
closed and exact. The theory of forms allows an explicit construction of the 
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Q. For example, in the Appendix we show 


1 
TrQ" = nd / dtt™' Tr[A(dA +tA’)""}} 
’ 0 
= d@n-1. (8.4.6) 


Thus, the trace of the product of curvature two-forms is exact and closed. This 
is a generalization in the language of forms of previous identities that were 
proved by hand. For example, in four dimensions we have (8.2.7) and (8.2.17). 
For arbitrary dimension we have (8.2.16). This new form w,_; is called the 
Chern—Simons form, which we encountered in (8.2.8). 

This is important, because earlier we showed that in two and four dimensions 
there are polynomials in the curvature tensor that are total derivatives. But 
we were unable to construct these higher topological terms. Equation (8.4.6) 
summarizes the solution to the problem. 

Using the language of forms, we can also find a convenient expression for the 
Wess—Zumino consistency condition. In D-dimensional space, let us formally 
define a product of curvature forms that we call /p+2. (Ofcourse, this is actually 
zero for D + 2 dimensions, but we will put aside this point.) This form, by 
construction, is exact, 


Ip42 = d@p41, (8.4.7) 


where wp. | is a Chern—Simons form. /p,. is also gauge invariant because it 
consists entirely of curvature forms. The fact that it is both gauge invariant and 
exact thus leads us to write 
balpat = Sadwp4i 
= db,@p41 
= a wp41 
== 0), 


Thus, we see that the gauge variation of the Chern—Simons form is closed. We 
can write locally 


b,@p+} _ dWp. (8.4.8) 


The purpose of this construction is to find a form for the anomaly G such that 
it is manifestly a gauge variation of some other form. We now write 


M 


Notice that we can use Stokes’ theorem to show 


| op = f doy = 6s f wos (8.4.10) 
M D> a 


-where the surface M and © are related by M = 0X. 
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This is our desired result. We have now shown that the anomaly term G is 
a gauge variation of another form. Thus, G automatically satisfies the Wess— 
Zumino consistency condition, because 


Bn, G(A2) = Bn,Ba: f oer (8.4.11) 


We will use this construction when we actually cancel the anomalies in the 
superstring. 

Armed with the theory of forms, we can now construct the characteristic 
polynomials at will. 

We will find that there are four major characteristic classes: 


(1) Chern classes; 

(2) Pontrjagin classes; 

(3) Euler classes; and 

(4) Stiefel-Whitney classes. 


Let us now restrict the curvature form 2 to be a member of GL(k, C), i.e., 
an arbitrary k x k matrix with complex elements. Let us define the roral Chern 
form over this group as 


c(82) = det (1 ae 5-0) =e ot2je =. (8.4.12) 
where we have power expanded in powers of £2. The terms in the expansion, 
which eventually terminate, are 

co = 1, 


I 
eq = —TrQ, 
20 


l 
C2 = — (Tr(Q A Q) — Tr(22) A Tr(Q)), 
872 
= -= (-2TV(QAQAQ)4+3TK(QAQVATrQ = (8.4.13) 
— Dese A TES2-A Tr 92), 


ae (=) det Q, 
21 


dc;(Q) =0. (8.4.14) 


C3 


where 


Notice that the series eventually ends, because Q raised to a high enough 
power is equal to zero because the dx‘ are Grassmann-valued. In addition to 
the Chern classes ¢,, we can also introduce the Chern character: 


ee (8.4.15) 
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We can always diagonalize the matrix Q so that it has only eigenvalues x; 
along the diagonal. Thus, we can always find a matrix § such that 


x\ 0 0 0 
0 X2 0 0 
SQS7! = (-20i 
(2) | aa m0 (8.4.16) 
0 0 O 
After diagonalization, the traces and determinants become trivial: 
c(2) =|] +). (8.4.17) 
j=l 
Notice that the topological invariant for SU(2) in four dimensions is simply 
M) = det | 1 + —A,F* }=1+—<Tr(FA F). A. 

c(M) et ( aarp ) + 3 r(F AF) (8.4.18) 


In addition to the Chern classes, there are the closely related Pontrjagin 
classes. Let Q be a member of O(k), and then define 


| 1 
p(2) = det (1 = 2-21) = 14 pt pate (8.4.19) 


The Euler class, however, is defined slightly differently. It is not based on 
the determinant, like the Chern and Pontrjagin classes, but on the Pfaffian. Let 
a be a two-form: 


a = taj; dx' Adx!. 
Then raise it to the rth power: 
a’ =r! (20 e(a)dx! Adx* A+++ A dx”. (8.4.20) 


Notice that this is an antisymmetric matrix. We cannot diagonalize an an- 
tisymmetric matrix such that its diagonal elements are its eigenvalues x;. 
However, we can write it in the form 


0 x| 0 0 


OSS Xe Ae (8.4.21) 


The invariant e(a), which is built out of the antisymmetric matrix q, is called 
the Pfaffian. In terms of these x’s, the Pfaffian is equal to 


e(M) = Xj X2°°*X2,- (8.4.22) 
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We can also conclude the Pfaffian e(M) is the square root of the highest 
Pontrjagin class: 


p(M) = x7x2 ---x? = e?(M). (8.4.23) 


Some examples of the Euler class are 
1 
(9 peas e(M a ee Fp ie 


D=4: e(M)= ——Eabca R® A R%, (8.4.24) 


sa 
3272 
where 


X(M) = [ e(M), (8.4.25) 


which are precisely the identities that we mentioned earlier in (8.2.18) and 
(8.2.19). 

Finally, we want to list some of the other characteristic polynomials that are 
of interest. If x; labels the eigenvalues of the curvature form {2, then 


Todd class = td(M) = I] , — 
Hirzebruch class = L(M) = - = —, (8.4.26) 


] . 
a roof polynomial = A(M) = I] : ™ ai 
4 sinh 4x; 


We will find, for example, that the A invariant polynomial plays a crucial role 
in calculating the anomaly for spin-+ particles coupled to gravity. 

Lastly, we have the Stiefel—Whitney classes, which, unfortunately, are not 
related to any curvature form. Instead, the Stiefel-Whitney class is important 
in deciding which manifolds are spin manifolds, i1.e., which ones can admit 
spinors. Many manifolds that at first glance seem to admit spinors actually 
have a problem. We merely quote important features of the Stiefel Whitney 
class index. The vanishing of the first Stiefel-Whitney class index means that 
the manifold is orientable: 


@, = 0 = M is orientable. (8.4.27) 


Most important, a manifold M can admit spinors if its second Stiefel -Whitney 
class index is equal to zero: 


@| = @ = 0 > M isaspin manifold. (8.4.28) 
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8.5 Dirac Index 


Now that we have listed some of the important properties of characteristic 
classes, we will show that the integrals over certain characteristic classes yield 
elegant index theorems. One of the most important will be the Dirac index, 
defined on a spin manifold. In fact, it is possible to reexpress the other invariant 
in terms of the Dirac index. If we have a spin manifold in N-space, then it is 
possible to define the Dirac equation in this space: 


MDa = Dv =0, (8.5.1) 


where I“ are the Dirac matrices (multiplied by the vierbein). Let us examine 
the operator P given in (8.3.2) and its eigenvalues: 


IDWn = AnWn- (8.5.2) 


The kernel of the operator D comprises those solutions which are annihilated 
by the operator, i.e., have zero eigenvalues. 

In even-dimensional spaces, the spinor space actually divides in half, 
depending on the eigenvalue of the operator: 


Pps Vt = Ee. (8.5.5) 


We will call these solutions positive and negative chirality solutions. In general, 
for nonzero eigenvalue, the chiralities always occur in pairs. This is because the 
eigenvalue mimics the mass term of the Dirac equation, and massive fermion 
states cannot be split into two Wey! fermions, which are always massless. 
However, for the zero eigenvalue case, the chiralities do not have to occur in 
pairs. We can have unequal numbers of positive and negative chiral states for 
zero eigenvalue fermions. 

Let us define ns to be the number of independent solutions with zero eigen- 
value of positive (negative) chirality. Let us also define the index of the Dirac 
operator to be the difference between the numbers of independent zero modes 
with positive and negative chirality: 


Index(P) = ny —n_. (8.5.4) 


It is crucial to note that this number is a topological number; i.e., it remains 
invariant under a continuous change in the manifold that does not change the 
overall topology. The Dirac index is a topological invariant because, after a 
transformation on the manifold, transitions will occur between zero eigenvalue 
and nonzero eigenvalue fermion states, but only pairs of nonzero eigenvalue 
fermions can enter the zero eigenvalue state or leave it. This is because, in 
order to become massive, chiral states must find a partner. Thus, the changes 
that occur in the index occur only in pairs, and each pair contributes zero to 
the index. Thus, the index must be a topological invariant (see Fig. 8.2). For 
example, if we have N pairs of fermions entering the zero eigenvalue state, 
then 


6 Index(P) = én, —-dn_ = N—-—N =O. (8.5.5) 


354 8. Anomalies and the Atiyah-Singer Theorem 


le 


FIGURE 8.2. Pairwise emission or absorption into the ground state. Fermions can 
enter or leave the ground state only in pairs with opposite chirality. Thus, the total 
difference between positive and negative chiral states populating the vacuum always 
remains the same. This constant is called the Dirac index. 


There are several ways in which we can express this index. Since the kernel 
of an operator is defined to be the set of states that it annihilates, we can also 
write 


Index( PD) = dim ker(P,) — dim ker(D_), (8.5.6) 


where we have split up the Dirac operator into positive and negative chiralities. 
Another way to write the index formula is 


Index(P) = ) | (n| Tp41 |n) 


n 


= Tr(Tp+1) 
=y1-yo1 
+ — 
jae fe (8.5.7) 


Notice that each positive chiral state contributes +1 to the sum. while each 

negative chiral state contributes —1, so the sum yields the Dirac index. 
Although the above formula is quite elegant, it must be carefully regularized. 

Noticed that a careless definition of the index leads to meaningless results: 


in, — Un_ = o-oo. (8.5.8) 


Because the cancellations occur level by level, it is incorrect to sum over all 
positive levels first and then subtract the sum against all negative levels. The 
sum over all positive or negative levels is infinite by itself. Only the difference 
is finite and well defined. 

The proper way to treat the index is to insert a factor that allows us to define 
the sum over all positive and negative levels separately (which is potentially di- 
vergent) as a function ofa parameter. One simple expression for the regularized 
Dirac index is 


Index( P) = Tr(T p41 er), (8.5.9) 


where f is an arbitrary positive number and D° is the square of the Dirac 
operator. Because we are taking the trace, we are free to diagonalize the Dirac 
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One squared, so we can rewrite the trace in terms of the eigenvalues 1; of 
De 


Index(D) ae > sign(i)e~ > , 
sign(i) = +. (8.5.10) 


Because the massive eigenvalues come with equal numbers of positive and 
negative chiral solutions, they cancel because they contribute with different 
sign (i) and hence do not occur in the sum. The zero eigenvalues n;+, however, 
do not necessarily come with equal numbers of positive and negative chiral 
states, and hence we have once again 


Index(P) = » sign(ije°™' 
= Unis + (-ni-)] + SofeP* — e F¥] 
ety) 


Li #0 
=n. — Ne. (oan) Np) 


In summary, fermions can enter and leave the zero eigenvalue state only in 
chiral pairs. Thus, the difference of the positive and negative chiralities are not 
affected by the presence of nonzero eigenvalue fermions (which always occur 
in pairs, and hence their contribution to the index cancels exactly). Thus, the 
fermion states with nonzero eigenvalues do not affect the index at all. 

This is not an academic question because it bears directly on the anomaly 
term. For example, we know that the chiral current can be written as 


Jt = Wa ysyu. . (8.5.12) 
Naively, this is conserved if we use the equations of motion. The covariant 
divergence (suppressing isospin indices) is equal to 
Dig ded ea An | 
= PysPb — (Dw)ysv. (8.5.13) 


Normally, this is exactly equal to zero because y“D,‘¥ = 0. We must be 
careful now to include the contribution of the vacuum expectation value, which 
in general may not vanish. Let us define 


Sr(x, y) = (Ol ¥x)h(y) 10), 
y" D,Sr(x, y) = 6(x, y), (8.5.14) 


and take the vacuum expectation value of the divergence of the current. Then 
we find 
Dice = 2 Tr(ys DS-(x, y))x=y 
Se? T1552): (Stor l5) 
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The last expression, which contains 5(x, x), makes no sense unless we regular- 
ize the integral. This is to be expected because we saw earlier that the triangle 
graph was divergent and required careful regularization. 

One standard method of regularization is called the heat kernel method, and 
it inserts a converging factor into the expression: 


D, J = lim 2 Tr{yse*?*5(a, eee (8.5.16) 


where the trace is taken over spinors, and we have regulated the expression so 
that it is finite (for positive T). 

But notice that this expression for the anomalous term in the current con- 
servation expression is precisely the value of the Dirac index. Thus, we have 
established a direct link between the Dirac index and the nonconservation of 
the axial current. 

When we evaluate this trace for an arbitrary spin manifold, we find, using 
(8.4.19) and (8.4.26), 


Index(P) = i A(M) 
M 


il 1 
=1-— ——(7p? —4 ree, 
54g?! © eae Pi — SP2) + 
where the dimension of space-time is a multiple of four. In four dimensions, 
we have 


1 1 
D=4: Index(P) = —— p; = ——— 
P)= a4?! = 34. an? 
For the case of the spinor interacting with a Yang—Mills field in a background 
metric, the index becomes an integral over the product of two factors, the A 
roof genus coming from the gravitational part and a term coming from the 
gauge part: 


[re AR). (8.5.17) 


Index( DP) = / A(M) ch(V). (8.5.18) 
M 


Notice that this formula can exist in even dimensions, not just multiples of 
four. For D = 2 we have 


Index(D) = | a= == | Tr F. (8.5.19) 
M 20 

For D = 4 we have 

dim V 

24 - 8x? 


For the case of interest, six dimensions, we have 


Index( PDP) = 


l 
[re fie | THF AF). (8.5.20) 
872 


l 
Index( P) = [mers FAF)—- _ | TR ATER R285 21) 


| 
48 - 872 
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In general, the Atiyah—Singer theorem can be written as the integral over 
the product of two curvature forms, one for the gravitational part and one for 
the gauge part. For completeness, let us list the four classical complexes and 
the various index theorems that result from them: 


Complex Index Index Theorem 

de Rahm Gauss—Bonnet _Index(d + 5) = x(M) = f e(M) 

Signature Hirzebruch t(M) = f L(M) A ch(V) (8.5.22) 
Dolbeault Riemann—Roch Index(@) = f td(M) A ch(V) 

Spin A roof genus Index(P) = f A(M) A ch(V) 


where e is the Euler characteristic, td is the Todd characteristic, ch is the Chern 
characteristic where we double the value of the curvature form, ch is the usual 
Chern characteristic e®, and 0 is the complex conjugate of 0: when we use 
complex coordinates z, Z to describe the two-dimensional surface. (We will 
discuss complex manifolds at length in Chapter 10.) 
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Now that we have erected a powerful apparatus by which we can construct 
any polynomial using the method of characteristic classes, let us investigate 
anomalies in theories where spinors are coupled to gauge and gravitational 
fields, which were calculated by Alvarez-Gaumé and Witten [12]. Let us begin 
by reviewing some elementary features of spinors. 

In any even dimension D = 2k, the Dirac matrices are complex 2*- 
dimensiona! matrices. In general, these Dirac matrices split into positive and 
negative chiral representations. In an odd number of dimensions D = 2k + 1, 
the Dirac matrices are also 2*-dimensional, and there is only one represen- 
tation. We do not have negative and positive chiralities in an odd number of 
dimensions (and hence no anomaly). Therefore, we will restrict our attention 
to an even number of dimensions. 

Let us define the matrix 


pep — oe eo. (8.6.1) 
Then 
Peep tt = 4k +2, 
Gee — ee = 4k, (8.6.2) 
Then, p+; has eigenvalue +1 if D = 4k + 2. Thus, under a combined CPT 


reversal, the chirality of a particular state does not change. Positive chiral states 
are mapped into positive ones, and the same for negative states. We find 


Wi o WV . 


8.6.3 
jo (8.6.3) 


CPT(D = 4k +2): | 
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Thus, it is possible to have unequal numbers of positive and negative chiral 
states, and hence an anomaly is possible. 

However, I'p,; has eigenvalues +i if D = 4k. Thus, under a C PT reversal, 
states with [p,, = i are transformed into states with [p,,; = —i. Positive 
chiral states are converted into negative ones, and vice versa: 


Prove, 
CPT (De 4k): (8.6.4) 
p< P+. 
If D = 4k, then positive and negative chiralities occur in pairs, and the 


gravitational interaction will not lead to anomalies. 

In summary, gravitational anomalies are possible only if D = 4k + 2. 

Now let us analyze Dirac, Weyl, or Majorana spinors. Weyl fermions can 
be defined in any even dimension. This is because the operator + 1+ TV p.1) 
can be defined in any even dimension. However, it turns out that Majorana 
spinors can exist only in 2, 3, and 4 (mod 8) dimensions. Furthermore, states 
that are simultaneously Majorana and Weyl can be defined only in 2 (mod 8) 
dimensions. We can summarize this: 


Spinor Dimension 
Dirac Any D 
Weyl D = even (8.6.5) 


Majorana D = 2, 3, 4(mod 8) 
Majorana—Weyl D = 2 (mod 8) 


Our particular case of interest is 10-dimensional Majorana—Wey] fermions. 
In 10 dimensions, three types of internal particle may contribute to the 
gravitational anomaly: 


(ay spin-4 fermions interacting with N external gravitons; 
(2) spin-+ fermions interacting with N external gravitons: and 
(3) antisymmetric fourth-rank tensor fields interacting with N gravitons. 


It is obvious why the spinors must contribute to the anomaly. Their prop- 
agators converge only as 1/p, and the usual Pauli—Villars and dimensional 
regularization methods fail because we choose chiral fields with the ys factor. 

The self-dual fourth-rank antisymmetric tensor is also added because, re- 
markably, there seems to exist no naive covariant action associated with this 
tensor. It seems to be well defined in the light cone gauge but not covariantly, 
SO we suspect that it may also have an anomaly. 

We begin with a spin-5 particle coupled to gravitation: 


S= [ a?xeeriy'D, [a -y°*)] ¥. (8.6.6) 
Expanding around flat space: 


ella = Nee a ue fees, (8.6.7) 
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We are interested in the lowest order coupling of the vierbein field with the 
spinor: 

Ly 

Ly 


eee Vout —y)y, 
ea) al — yy. (8.6.8) 


Specifically, we are interested in the single-loop diagram with a spin-+ fermion 
circulating internally coupled to external graviton legs. The first term con- 
tributes a standard three-particle vertex function with a Feynman vertex 
consisting of one momentum and combinations of polarization tensors. The 
second term, however, is a four-point Feynman graph, the sea-gull term. 

At first, it seems hopeless to calculate the single fermion loop diagram with 
an arbitrary number of graviton legs, but there are tricks we can use to reduce 
it to a rather simple problem. Specifically, using simple symmetry arguments, 
we find that we can reduce the problem to the scattering of a scalar charged 
particle intersecting with a constant electromagnetic field. Fortunately, the 
problem of charged scalars in QED is a well-studied problem with a known 
solution. Thus, the key to the problem is to reduce the complex problem to one 
that is already known. 

This trick also generalizes for the other two cases. We are also interested in 
the scattering of internal spin-3 and antisymmetric tensor fields with external 
gravitons. Symmetry arguments again allow us to reduce the problem to amuch 
simpler one. The circulating spin-5 particle can be reduced to an internal vector 


particle, and the antisymmetric tensor field can be reduced to a spin-+ particle: 


spin + and spin 2 — spin 0 and spin 1, 


[uo 


spin 5 and spin 2 — spin | and spin 1, (8.6.9) 


antisymmetric tensor and spin 2 — spin { and spin |. 


By Feynman’s rules, it is easy to construct the polygon graph. We know that 
the amplitude for the scattering of spin-5 particles can be represented as 


ge? MR(e?. ps) Z(6", p”), (8.6.10) 
where M = regulator mass and 
R(e, pD) = elit danse pide ay. pees geet, (8.6.11) 


And where the ith external graviton’s polarization can be given by 
CFU 
ny = Ey 8 - (8.6.12) 
In the above equation, notice that we have the freedom to choose the polariza- 
tion of the ith external graviton to be the product of two spin-1| polarization 
vectors. This is the key that allows us to rewrite the original scattering amplitude 
of spin-2 particles in terms of a reduced one with spin-| particles. 
Z, fortunately, has now been reduced to the scattering of a propagating 
» charged scalar particle (of charge +) interacting with a constant electromag- 
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netic field, which was computed by Schwinger decades ago [13]. Let the EM 
field be represented by 


2k+1 


Evi ) Gey — Pye, (8.6.13) 


Earlier, in (8.3.5), we used functional techniques to show that the anomaly 
term was related to the logarithm of a determinant of a propagator. We showed 
that 


P(A,) = Indet($(1 + P41) P). (8.6.14) 
Then Z is 
1 
Z = — Indet(—D, D“ + M’) 
vol 
—1 {“d "i 
a es er ae ae (8.6.15) 
vol Jo 5S 
Schwinger proved that 
1 1 eB 
— Tree * — —_—___., 8.6.16 
vol ts 4n sinheBs ( ) 


where B is the effective magnetic field of the EM field and e is the charge. (We 
will present a more general proof of this at the end of this chapter.) Although 
we cannot diagonalize the antisymmetric Maxwell tensor, we can always write 
it as 


Fy =a) See 0 (8.6.17) 


Then the anomaly contribution equals 


2k+1 


ly = —i(2r)-*"! Re, p™) I] 


. 8.6.18 
=a iS ( ) 


As expected, this is just the index of the Dirac operator. or the integral of A 

Next, we have to calculate the anomaly for a spin-5 fermion internal loop 
coupled to an arbitrary number of external gravitons. Once again, the trick is 
to use symmetry arguments to reduce the problem to a much simpler one. The 
action for a spin-3 particle coupled to gravity is 


L = —tet yyy, Di Wo. (8.6.19) 
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Notice that the spinor yy, has a vector index yz as well as a spinor index, which 
we suppress. Again, let us extract only the lowest coupling terms: 


hile 

ey 5 PW Ya dg s(1 = yyy, 
i _ 

Ly = Fe (hide hra byl? (1 — yy", (8.6.20) 
i z 

L3= 5 (8a hav =Ono)V yy ve. 

Once again, we can also write the general one-loop amplitude as 
by = PUMP RE, pPP)Z(E, p, (8.6.21) 


where Z represents the scattering of an internal charged vector meson inter- 
acting with a constant electric field. Let us write an effective theory of charged 
vector mesons that reproduces Z to one loop: 


era 
z=TrinH=- f° tre, (8.6.22) 
0 5 
where 
Hoy = —(8c + tiAc) Oy + ti Fwd’. (8.6.23) 
All determinants can be performed, and we arrive at 
. — dketl o1y  2k+l 
Isp. = i207)! RE, pM) T] — J @coshx; — 1). (8.6.24) 
gat Sinh 5%; 4x9 


The last anomaly term we wish to calculate is the contribution from an 
antisymmetric tensor field. When we calculate the anomaly associated with 
the tensor particle, however, we must be careful because there is no covariant 
action associated with this particle. Let us begin with an antisymmetric tensor 
field in 4k + 2 dimensions and its curvature: 


FeO Aue) tT CyCC perm. (8.6.25) 


If we impose the constraint that it is self-dual: 


l ae 
Fuyouser = Tyg qi emia Fanaa? (8.6.26) 
then we have problems. By the Bianchi identities, we can show that this relation 
is equivalent to the equations of motion. In other words, the naive action 


| Pe oe (8.6.27) 


Myo 
propagates both self-dual and antidual states, not just the self-dual ones. Thus, 
we have a problem. In fact, we can prove that a covariant action that propagates 
- only the self-dual states does not exist [14]. 
However, we can use a trick to calculate the scattering amplitude. Although 
, there doesn’t seem to be a covariant action, the Feynman rules for such a 
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particle can be written covariantly. The trick is to use a spinor field to mimic 
the antisymmetric tensor field. 
The energy-momentum tensor for this field is 
Tee pave 1g 2 

uv (2k)! HOOK“ 2(2k 4 ie tigew Oy +0241 * 
Fortunately, all the Feynman rules for this antisymmetric F tensor are known, 
even if its action is not known. Now we will rewrite this antisymmetric tensor 
in terms of fermion fields by a trick. Let us define 


(8.6.28) 


N 
Geo ge ee (8.6.29) 
n=0 


We can also invert this 
Faison =e eee (8.6.30) 


The advantage of using this embedding is that fermion fields are much easier 
to use than antisymmetric tensors. The two-point function is 


(Pap(4)bys(—4)) = (24°) vv" auar(Y°¥"4u)es + 9° Say5ps)- (8.6.31) 
The final amplitude can be written as 


_ SMH RECO, p)Z, (8.6.32) 


where Z represents the coupling of a charged scalar with external photons: 
Z=Trin(fiy’D, +iM) 


|e a 2 ; 
=-5 / dse™ Trexp(—s(-D, D" + iT” Fy). (8.6.33) 
0 
We know, however, that 
. 2k+1 
Tr esl uv Fuv = p2k+! I] cosh 1x7. (8.6.34) 
rit 


The final result for the anomaly contribution from the antisymmetric tensor 
field is 
2k+1 
— Len2k+l5_—-2k-1 ) Ge 
I, = Gil" 20 R(e, p®) I] ae 
j=) Sin 5Xi 


ee 
Pig 


cosh }x;. (8.6.35) 


Of course, we also have to calculate the anomaly contribution trom external 
gauge fields. The calculation there is almost identical to the one we have 
done, except now we must have n gauge particles corresponding to n gauge 
generators. 

Let us now put (8.6.18), (8.6.24) and (8.6.35) together, including the 
contribution from the mixed anomalies coming from the gauge sector. For con- 
venience, we will write the total anomaly contribution in terms of /p.>, which 
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we showed earlier in (8.4.7) was a convenient form for the anomaly because 
we could show that the anomaly term manifestly satisfied the Wess—Zumino 
consistency condition. We have 


= 
= —— Tr F° 
“ee © one 
ey * te r* + Lr Ry 
256 45 36 
(n — 496) 1 
are 


Tr F* Tr R2 


Roe ek in» (36.36) 


2.2835 4- 1080 


+ 


l 23 l : 4 I 23 
saa TRY] + Tre Trak Mi pega ky 
where R represents the curvature tensor, and F represents the Yang—Mills 
tensor. This is our final result. At first glance, it appears as if only a series of 
miracles can make such a horrible object vanish. However, a series of miracles 
indeed occurs. Specifically, we need to show that a string model is consistent 
with the following: 


(1) n = 496, which kills half the terms in the anomaly; 
(II) that the remaining nonzero terms can be regrouped into a factorized form; 
and 
(III) that this reduced, factorized form can be canceled by other terms coming 
from an effective point particle action AS. 


Remarkably, all three conditions can be imposed simultaneously. 

First, condition (1) is easy to satisfy if we set n = 496. Then a large number 
of terms vanish in (8.6.36). 

Second, condition (II) is considerably harder to satisfy, but we can show 
that the remaining terms in (8.6.36) factorize into the product of two terms if 
Tr F° can be reexpressed in terms of Tr F? Tr a anid (Tis)? ,1.¢., 


1 
Tr F° = me TF AF = (ler). (8.6.37) 


14, = 
If this strange-looking equation is satisfied, then we have the further reduction: 


ha ~ (Tr R? +k Tr F’)Xz. (8.6.38) 
The a (8.6.36) would vanish if k = — + and if 


Xg= irr -aQ(rF’y - 4,1 F oom es 
(8.6.39) 

At first, it seems remarkable that under any circumstances can we satisfy 
condition (II) by having these rigid relations hold. 

To satisfy condition (II), we must check which Lie groups are compatible 
with this strange constraint (8.6.37). Notice that the trace over the gauge ma- 
trices was defined in the adjoint representation. If we now convert this to traces 
, over the fundamental representation we will use the symbol “tr.” 
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The proof that groups exist which satisfies (8.6.37) is straightforward. First, 
we know that, in the fundamental representation of SO(/), the matrix F can be 
represented as an antisymmetric / x / matrix. In the adjoint representation, F 
can be represented as 


Fab.cd = + (FacSba — Fryebad — Fya8pe i FiqSac)- (8.6.40) 
We can insert explicit expressions into the trace calculation of F © and we find 
Tr F® = (( — 32) tr F° + 15te F? tr F*. (8.6.41) 


In order to satisfy the factorization condition, we must be able to eliminate a 
number of terms, including the sixth power of the curvature F°. Notice that 
this is possible if / = 32, which has precisely 496 generators. Thus, we are led 
back again to SO(32). 

The proof that Es ® E3 also satisfies (8.6.37) is a bit more involved. We need 
to know whether or not the Clebsch—Gordon coefficients of the group enable 
us to write independent invariants that allow us to contract four or six F’s. 
Mathematically, we need to know if there are independent Casimir operators 
of orders four and six. Fortunately, there is a theorem which says that if the 
homotopy group 72,,_;(E£s) contains the integers, then there is an independent 
Casimir operator of order n. (Homotopy is a way of generating equivalence 
classes whose members are continuous maps, rather than spaces or surfaces.) 
It can be shown that the first homotopy groups satisfying this condition are 73 
and 705: 


703(Eg) = Z, I15(Eg) = 7, 7;(Eg) = for3 <i — 1s (8.6.42) 


Thus, the only independent invariant of interest is of order two, which means 
that Tr F4 and Tr F° are not independent and can be rewritten in terms of Tr F>. 
The last step is to actually calculate the coefficient appearing in Tr F° (Tr F*)*. 
Since we are only interested in the overall coefficient, we can always calculate 
this by taking a specific representation of a subgroup within Eg, i.e., SO(16), 
and the adjoint representation 248 can be decomposed, under SO(16), into the 
sum of 120 @ 128. Thus, (8.6.37) can be proved for Fs ® Ex as well as SO(32). 
(The equation can also be shown trivially for U(1)""° and Ex @ U(1)"**.) 

We are not quite finished yet. We still have to satisfy condition (III) and 
show that there exists an effective low-energy counterterm AS that finally 
kills the anomaly. If we take naively D = 10 chiral supergravity coupled to 
496 super-Yang—Mills fields, we find that the Xx term does nor vanish. Thus, 
chiral D = 10 supergravity must be ruled out as an acceptable quantum theory. 
However, the superstring theory has new coupling terms in the zero slope limit 
that can cancel the remaining terms. 

As a low-energy approximation, superstrings can have more terms in its 
action than appear in the D = 10 chiral supergravity action. At first, this may 
seem surprising, because supergravity is the low-energy limit of superstring 
theory. However, even at the low-energy levels, there is a difference: the sum 
over an infinite number of Reggeon states will, in general, supply us with 
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more Feynman graphs than we would naively expect. (For example, the Yang— 
Mills theory has an action with up to four fields in its Lagrangian. Yet the first 
quantized string theory predicts only three-Reggeon couplings. Where is the 
missing four-particle vertex? It comes from the sum over an infinite number 
of Reggeon states, which will give us effective four-particle states in the low- 
energy limit. In general, higher trees and loops give us all the terms necessary 
to give us the Yang—Mills and gravitation theory.) 

In fact, we can show that there exists an effective tree-level term that will 
cancel the anomaly term. To show this, let us first write the 12-form as 


io ~ (it R* — tr F°) Xz, (8.6.43) 
which can be rewritten as 
In ~ (dw3, — dw3y)X¢ (8.6.44) 
because of the identities: 
Tr R? = dw3,, 
tr F* = day, 
wsy = tr(AF — iA’), (8.6.45) 


@3, = Tr(wR — 4”). 
We also have 


6W3y = —dwyy, 
Sa3, = —dwy,, (8.6.46) 


where the omega terms are Chern—Simons three-forms corresponding to either 
the Yang—Mills (Y) or the Lorentz (L) connections. We would now like to 
express the 12-form /;) that we have carefully calculated in terms of a new 
10-form w 9, so that the Wess—Zumino consistency conditions are satisfied. 
As in (8.4.10), our strategy is to find the G term by constructing w19 and 1, 
such that 


Kz = do, 


i G= 5. f on = [ dow = [ W10- (8.6.47) 
zx z M 


It turns out that, working from a 12-form down to a 10-form, there is a 
certain amount of arbitrariness in the final result. It is not hard to show that, 
starting from /;2, we can work down to an | 1-form and then to a 10-form. We 
simply quote the final result: 

O11 = $(W3_ — W3y)Xe + 2(Tr R? — tr F’)X, 
+ ad((w3, — w3y)X7), (8.6.48) 
where a is an arbitrary constant, and also 


wi = (3 +a)(Tr R? — tr F*)X6 + (5 —a)(wr — @ry)Xs, (8.6.49) 
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where we have the following expressions for X¢ and X7: 


adX7 = Xz, 
5X7 = dX¢. (8.6.50) 
Combining everything, we now have an expression for G: 
G= ++) ) [ton — w3y)dXo + (4 =a) fon — Woy \Xe. (8.6.51) 


This is the factor G that we wanted to calculate. Now the question is whether 
or not the D = 10 supergravity coupled to the super-Yang—Mills theory can 
contribute an anomaly term that cancels G. It is not hard, given all these 
identities, to show that the addition of the following effective action will give 
a new contribution to the anomaly term that will precisely cancel G: 


AS ~ i BXs — (§ +a) i (w3,, — W3y)X7, (8.6.52) 


where @ is a constant, the w3; terms are Chern—Simons terms, w is the spin 
connection, and B is the two-form By, that appears in the coupling of the 
supersymmetric Yang—Mills theory with D = 10 supergravity (the Chaplain— 
Manton action). A careful comparison shows that they yield an anomaly-free 
theory if we choose the variation of the B field as 


$B =a), — 2h, (8.6.53) 


It is important to notice that this is not the usual variation of the B field found in 
supergravity theory. Thus, the Chaplain—Manton supergravity action is riddled 
with anomalies. This is discouraging, until we realize that in superstring theory 
we have effective terms arising from loops and the summation over an infinite 
number of resonance states. Thus, it is possible to have this variation of the 
B field and still have an acceptable supergravity theory. The next problem, 
however, is to explicitly show that this cancellation takes place as promised. It 
is one thing to show that the B field might have the correct variation such that all 
anomalies cancel. It is another thing to show that it actually happens that way. 

Surprisingly, the cancellation of anomalies for the entire superstring turns 
out to be much simpler than the cancellation of anomalies for the point particle 
supergravity system! 


8.7. Anomaly Cancellation in Strings 


We saw that a series of miracles happens to cancel all anomalies if we have 
n = 496 and the gauge group is equal to SO(32) or Ex © Ex. These arguments, 
however, were formulated only in the low-energy approximation and must be 
redone for the general case. Surprisingly, because the string model treats the 
tower of resonances in a compact fashion, the calculation for the superstring 
is much easier than the calculation for the supergravity theory [15]. 
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Let us begin with the Pauli—Villars regularization method, replacing the 
usual propagator with a massive one: 
1 1 Fo +im 
—_—- = ——.. 
Fo Fo—im  Lo+m? 
Thus, we adopt the philosophy of ordinary field theory, that the anomaly occurs 
because the Pauli—Villars regularization procedure (or any other regulariza- 
tion program) violates chiral invariance. Here, the addition of the mass term 
explicitly breaks chiral invariance. 
Before regularization, the single-loop hexagon diagram that violates parity 
looks like 


(8.7.1) 


70) = f apt (Fv. Pv). (ea) 
ily Lo 
where 


and where t(1 +111) is the projection operator that selects out the “even G 
parity” states of the NS—R model: 


i eee 
Dg = (—1)hea onde (8.7.4) 


(Without the GSO projection, there is actually no anomaly, because the theory 
is then parity-conserving.) 
The amplitude that we want to calculate is 


lim [T(0) ~ T(m)]. - (8.7.5) 


In the limit of large m, the dependence on the mass should drop out and 
leave a finite result. Let us now perform all the traces over the Dirac matrices. 
Explicitly, the amplitude that is left is 


1 ] 
eee 10 Oa ors). 8.7.6 
G~ im e(e.k) fd ptr(> m2 Voll) meee o(6)'y }, (8.7.6) 


where 
ECC TR oe eee emer cs hy an Re. (8.7.7) 


Special care must be exercised in taking the limit as the mass goes to infinity. 
In general, we must add together the contributions from the planar and the 
nonorientable single-loop graphs. We find 


1 5 
GE ele. wf ] [avie@n — v;)(0| Vo(ki, z1)++- Volks. 26) 10) , 
OR = 
: (8.7.8) 


2 5 
Giio ~ EG, i | ] [@vie@in — v;) (O| Vo(ki, 21)+ +> Volke, 25) 10) . 
0 j=) 
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Putting everything together, including the isospin factors coming from the 
Chan—Paton factors, we have 


foes 
G ~ (n + 321) Tr(Ay An ++ Age(C, k) / [ [@»: 
0 j=1 


x O(vi41 — V;) (O| Vo(ki, 21) --* Volks, Z6) 10) . (8.7.9) 


To have a vanishing anomaly term, we need only show that n + 321 is equal to 
zero. It is important to analyze where this n + 32/ factor comes from: 


(1) n comes from the fact that we have to trace over both the inner and 
outer edge of a disk with a hole. The outer trace gives the Chan—Paton 
factors; the inner trace (which has no external lines) gives the factor 


Techy — Te 
(2) The factor / comes from the isospin group and equals 
1 Usp(n), 
l= 0 U(n), (8.7.10) 
—1 SOf(m). 


(3) Most important, the factor 32 comes from several sources. There was a 
factor of 32 in the Jacobian because the nonorientable graph had an in- 
tegration from 0 to i, while the planar graph had an integration from 0 
to 1. Furthermore, the integration region was only 4 as big. And finally, 
there was another factor of 32 coming from the 32 ways in which an odd 
number of twists can be placed on six internal propagators. Thus, to have 
a cancellation, we must fix 


| = —1 — gauge group is SO(n), 
n + 321 = 0 — gauge group is SO(32). (8.7210) 


In summary, we have an anomaly-free theory with Chan—Paton factors if 
we choose SO(32) as our gauge group. 


8.8 Summary 


We have used Chan-Paton factors and the anomaly cancellation to fix the gauge 
group of string theory. First, the Chan-Paton factor is simply the trace over 
various group generators that multiplies the Veneziano—Born term: 


T1,2, 35..., N= Yo Tr(A1A2A3 -+ Ag Atlee 3, 2.2, NV). 8.19 


perm 


Unfortunately, the only restriction that unitarity places on the group is that it 
must be Usp(N), SO(N), or U(N), for arbitrary N. 
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Much more stringent restrictions on the group are found when we demand 
that the model be anomaly-free. In general, an anomaly occurs whenever a clas- 
sical symmetry of the Lagrangian does not survive the quantization process. 
The chiral anomaly arises, for example, because the method of regulariza- 
tion (e.g., either Pauli—Villars or dimensional regularization) must necessarily 
violate chiral invariance. 

In particular, the divergence of the axial current, instead of being zero, is 
now equal to 


1 
O,J¢? = rere Fav Pop, (8.8.2) 


which is a total derivative or a topological term, given by 


J = —8r7 HP” Tr(AgdgAy + 3 AgApAy). (8.8.3) 


In general, the anomaly in a higher dimension will be proportional to a 
topological term. For example, using the theory of forms, we can show that 
the product of the curvature tensor can be written as the divergence of another 
form. For example, we can show that the trace over curvatures can be written 
in terms of the divergence of a Chern—Simons form: 


1 
Tr Q” =nd i dit!" Tr{A(dA + tA*y""}. (8.8.4) 
0 


The study of these invariant polynomials takes us to the theory of charac- 
teristic classes. There are four classical characteristic classes. The Chern class 
can be defined as 


c(Q) = det (1 ao 2) =14+¢(Q)+o(Q)4+---. (8.8.5) 
The Pontrjagin class is defined for O(k) as 


1 
p(a) = det (1-2) =1+ rt pte. (8.8.6) 


The Euler class is defined in terms of the Pfaffian: 
a” =r! (2m)'e(a) dx! Adx? A--» AN dx”. (8.8.7) 


Lastly, there is the Stiefel—Whitney class, which cannot be written in terms 
of curvature forms. However, we will find the Steifel-Whitney class to be 
important when we are analyzing spin manifolds. In particular, the vanishing 
of the first two indices yields an orientable spin manifold on which we can 
define spinors: 


@, = 0 < M is orientable, 
@| = ®) = O > M isaspin manifold. (8.8.8) 
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The index theorems, in turn, are usually written in terms of the following 
invariant polynomials: 


xj 
Todd class = td(M) = I] Tee 
Xj 
Hirzebruch class = L(M) = | | ae (8.8.9) 


! 
A ME 
a roof polynomial = A(M) = | | —~—. 
pon I] sinh 4; 
The most useful of the index theorems is the Dirac index theorem, which 
concerns the number of positive chirality zero eigenvalue solutions to the Dirac 
equation minus the negative chirality solutions. Using the supersymmetric 
sigma model, we can show the following: 


Index(P) = Tr(—1)e*” 


1 
= 5 f a?x3.0"° 


ee : a1 
= (=) [tee aee"'” (4) sinh :R| (8.8.10) 
. 2 2 


Armed with this theoretical apparatus, we can calculate the gauge and grav- 
itational anomalies found in supergravity and in string theory. In general. the 
gravitational and gauge anomaly contributions arise because the internal line 
can be either: 


(1) achiral spin-+ fermion; 
(2) achiral spin-3 fermion; or 
(3) an antisymmetric tensor that has no covariant action. 


The calculation of the anomaly contribution is carried out explicitly with 
Feynman diagrams. The calculation, however, simplifies enormously because 
we can make certain assumptions about the polarization tensor of the external 
lines, thus reducing its spin. Thus, the difficult problem of contracting over 
the various indices is reduced to a much simpler problem of contracting over 
a lower-spin particle. The final results are 


2k+1 Lt A 

Ih = —i2**' R(el), p')(4rr y+! a? |= 
iI 

hel 1. | (RG 


fay = iAP R(EM. p'”) TT] ae 2 — 1). (8.8.11) 
j= Sinh 5x i 


roe 5X; 


k+l 1 
=. | 
Ig = 4124120! RE, p) I] ee cosh 5X 
xX; 
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Putting everything together, we find the total anomaly contribution from 
both gauge and gravitational sectors: 


i 1 
lec = — as F°. a 4 2 
12 = 759 Ly ore ee 
1 1 
genes f F2 ey R4 2\2 
256 @ ‘ + sgt RY) 


| I 


] 
6 2 4 
6A R° + Tr R° Tr R 


29835) 4- 1080 


= 


(Tr R 7) (8.8212) 


1 1 1 
Tr R?)> | + — Tr R? Tr R4 
g- 1206 ®) + seg eae 
Miraculously, we can set this to zero if we make a few assumptions. First, we 
must set 7 to be 496. Then we assume that we can factorize the anomaly into 
the product of two terms: 


Io = GR? +k Tr F) Xz, 


i 1 1 
Xs = — Tr F* — Tr F?)? — —— Tr F? : 8. 
8= 5,10 7500! i i 540 Tekh Ire (8.8.13) 
l 1 
— Tr R* + —(Tr R*)’. 
=F 3 1 ots 30! ae) 
Notice that D = 10 supergravity is immediately ruled out because the 


above identity cannot be satisfied. However, string theory has one big ad- 
vantage over supergravity. The presence of higher spin fields in superstring 
theory means that the zero slope limit of the theory does not have to re- 
duce exactly to supergravity. In particular, the couplings of the B field in 
superstring theory are such that they can, in principle, cancel the terms shown 
above. 

The actual proof, however, that the anomaly term disappears must be carried 
out with the full single-loop hexagon graph in the string formalism. We use a 
Pauli—Villars-type regularization on the intermediate lines and then sum over 
the planar and the nonorientable loops. The anomaly is proportional to 


1 ] 
~ im?e(t,k) | d'°pTr {| ———~ Vo(1):-- ——— Vi ara). 
G ~ ime(s, | p ‘( (1) 5 MOCO y 
(8.8.14) 
where 
&(€, k) == eer il pea Coke aie “is (8.8.15) 
-and 


Py = (-Dea em, (8.8.16) 
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Finally, adding the planar and nonorientable diagrams together yields 


i 8 
G ~ (n +321) Tr(AyAg- + Ag e(E, o | [ [au 
0 j=1 


x O(v;41 — V4) (O| Voki, Z1)-++ Volk, 26) 0). (8.8.17) 
For this term to be zero, we must have n = 32 and 


1 Usp(n), 
l= 0 U(n), (8.8.18) 
—1 SO(n). 


Thus, the gauge group must be O(32). 
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CHAPTER 9 


Heterotic Strings and 
Compactification 


9.1 Compactification 


The most pressing problem facing string theory is the question of breaking a 
26- or 10-dimensional theory down to a realistic four-dimensional theory. Until 
such a dimensional reduction can be made, the theory lacks any real contact 
with physically measurable quantities. 

Until dimensional breaking can be done within the framework of field theory, 
however, the best we can do is look at classical solutions where spontaneous 
compactification of the extra dimensions has already taken place. In this chapter 
we will explore the heterotic string, which incorporates the groups Eg ® Ex 
and Spin(32)/Z, that arise when we compactify a 26-dimensional space down 
to 10 dimensions. 

As we saw in the previous chapter, the cancellation of anomalies allows for 
the possibility of either O(32) or Eg ® Eg. We saw, however, that the Chan— 
Paton method does not allow for the possibility of exceptional groups. Thus, 
we must use yet another method, arising from compactification on a self-dual 
lattice, to achieve an Eg ® Eg model. Although the heterotic string 1s a closed 
string theory, it contains within it the super-Yang—Mills field, which usually 
emerges in the open string sector for Type I strings. 

The heterotic string takes advaniage of the deceptively simple identity 


26 = 10 + 16. (Oa) 


This means that a 26-dimensional string, when compactified down to a 10- 

dimensional string, leaves 16 extra dimensions that may be placed on the root 

lattice of Eg @ Eg, which is known to yield an anomaly-free theory. This 
» observation was made by Freund [1]. 
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The heterotic string takes advantage of the fact that the closed string has 
two independently moving sectors, the right-moving and left-moving sectors, 
depending on whether they propagate as functions o + t or o —T. The heterotic 
string makes fundamental use of this splitting. The word “heterosis” implies 
“hybrid vigor.” This means that the asymmetric treatment of the left and right 
movers creates a hybrid theory considerably more sophisticated than the earlier 
Type I and II superstrings. It has been shown that it is has no tachyons, no 
anomalies, and is finite to one loop. 

Before we begin a discussion of the heterotic string, however, let us first 
describe the process of compactification by describing the simplest possible 
case, that of a scalar particle in a periodic one-dimensional space. This means 
that we make the identification 


x=x+2nR, (91:2) 


where R is the radius of this space, which is just the real axis divided out by a 
one-dimensional lattice T' of length 27 R: 


R 
S, = a (9.1.3) 
A field defined in this periodic space must therefore satisfy 
p(x) = o(x + 27 R) (9.1.4) 
which means that it can be power expanded in periodic eigenfunctions: 
E(y—= > Gre (9.1.5) 
where 
=~ 9.1.6 
p a R ’ ( ei. ) 


where n is an arbitrary integer. Thus, we see that the momentum conjugate 
to x becomes quantized in terms of integers. This is a feature common to all 
compactifications. 

Now let us generalize this to a particle living in five-dimensional space-time 
such that the fifth coordinates has curled up and become periodic. Consider a 
scalar field that satisfies the massless Klein—Gordon equation: 


Os@(x,, x5) = 0. (9.1.7) 


As before, we can power expand the scalar field in terms of periodic 
eigenfunctions: 


GG, 2 )= > gee”, (9.1.8) 


where ps = n/R. Notice that each eigenfunction can change the effective 
“mass” of the Klein—Gordon operator: 
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Several conclusions can be drawn from these simple examples: 


(1) The compactification of a dimension creates a quantization of the mo- 
mentum corresponding to the compactified coordinate. The momentum is 
labeled by integers. 

(2) The mass spectrum in the space-time dimensions that are not compacti- 
fied is shifted by an effective “mass” term coming from the compactified 
dimensions. 

(3) The radii of the various compactified dimensions can be totally arbitrary. 
There is considerable freedom in choosing the lattice on which we will 
compactify the space. 

(4) Wecan power expand the wave function in terms of periodic eigenfunctions 
of the compactified coordinate. In one dimension it is simply sines or 
cosines. In higher dimensions we may have spherical harmonics. 


Now let us consider the case of compactifying theories of higher spin, 
such as general relativity, which will generate a Maxwell field out of the fifth 
dimension. 

Historically, the idea of compactification first came from Kaluza [2-4], who 
wrote a letter to Einstein in 1919 proposing the idea of combining Maxwell’s 
electromagnetic theory with Einstein’s general relativity by expanding space— 
time to five dimensions. Kaluza proposed to write the metric tensor as 


Bar = ( aa ) (9.1.10) 


where we define the five-dimensional metric in terms of the Maxwell field A,, 
and the four-dimensional metric tensor g,,,: 
2511 = Bus a KAy, 


Pav = Suv k ApAy. (Oot I) 


For the moment, let us assume that the fifth dimension is not experimentally 
measurable because it has curled up into a very small circle. Thus, the fifth 
dimension is periodic: 


Xe = Ag 2 (1.12) 


That is, by traveling a distance 27 R in the fifth dimension, we arrive at the 
same starting point. The original five-dimensional Riemannian manifold is 
now assumed to have split up according to 


Rs = R4 x tk (9:1, 13) 


Now let us assume that the radius of the fifth dimension is so small that it 
cannot be measured. Thus, we can set 


ae a0. (9.1.14) 
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With this added assumption, the equations simplify enormously. The variation 
of the metric, for example, is usually given by 


b05y = 0,Ny + epee ae (9.1.15) 
When the circumference of the fifth dimension is small, we have 
685, = O,Ast+--:. (9.1.16) 


Written in terms of (9.1.11), this reduces to 
BAG ie Oy Nis) (9.1.17) 


which is precisely the U(1) gauge variation of the Maxwell field. Since there 
is only one action that has this U(1) symmetry and has only two derivatives, 
we thus conclude that Einstein’s theory, written in five dimensions, reduces 
to Maxwell’s theory coupled to a four-dimensional gravitational theory. For 
example, we can explicitly calculate some of the Christoffel symbols in this 
approximation: 


Psyv = $K(O,Ay — ByAy) + +++ = 3K Fyy. (9.1.18) 


We can thus explicitly reduce out the five-dimensional Einstein theory in this 
approximation, and we find 


= sav eRe” — iB FP +e. (9.1.19) 


Thus, a genuine Maxwell field emerges in four-space when we compactify a 
five-space theory of gravity. 

Similarly, we can generalize this result to higher manifolds, such that an 
N-dimensional manifold peels off a smaller compactified manifold Kp of 
dimension P: 


Ry aa Ry_p x Kp. (9.1.20) 


When this is done, we can reanalyze the gauge group that results from 
this breakdown, and we can derive the orthogonal groups or the SU(N) 
groups. In this way, we can show that Yang-Mills theory emerges out of 
higher-dimensional gravitation theory. 

Although this Kaluza—Klein formalism elegantly unites both the Yang-Mills 
and gravitational theories in one framework, there is a severe drawback to this 
approach that dates back to Kaluza’s original proposal, namely, Why did the 
fifth dimension suddenly curl up into a tiny ball? Klein's suggestion that it 
curled up quantum mechanically into a ball of radius equal to the Planck 
length was an important one, but it did not answer the difficult problem of how 
dimensional breaking actually occurs. We are still facing this problem, first 
raised over 70 years ago, in superstring theory. 
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Next, we wish to discuss the question of compactification in the framework 
of string theory [5], which must undergo the reduction from 26 or 10 down to 
four dimensions. Let us first study the compactification of the ith coordinate 
of an open string: 


i : é lige : 
X'(o, tT) = x' + 2a0'p't + as on cosnoe '”’, (9.1.21) 
n#~0 


As before, the periodicity of the ith coordinate enforces the condition that 
momentum is quantized according to integer multiples of the integer M; 


p= —, (9.1522) 


where R; is the radius of the ith compactified coordinate. This is exactly as 
before. In general, the radii of the compactified dimensions do not have to 
be the same. As before, we also find that the mass spectrum of the theory 
is shifted by the presence of the compactified dimensions. By analyzing the 
Hamiltonian, we see that the masses are given by 


am? = — )° MP +N, (9.1.23) 


ve 3 Seana | (9.1.24) 


Thus, the mass spectrum is shifted by the square of an integer, as in 
(9. 199). 

For the closed string, however, we have an additional contribution to the 
mass. There is the extra complication that the closed string can loop N; times 
completely around the ith dimension, much as a rubber band is wound around 
a cylindrical tube. It is important to notice that this configuration, which con- 
tributes a new term to the Hamiltonian, has no counterpart in point particle 
compactification. We thus have, for the closed string, 


X' = x' +2a'p't + 2N;Ro 
ee To acy ee 
Ge (ape + ale ME) O25) 
Saale 


We now have two integers M; and N;, which represent the effect of compacti- 

fication for the closed string. The integer N, in fact, describes the soliton state 

of a string wound around the compactified dimension an integer number of 
‘ times. Notice that this soliton state is stable because of topology. 


378 9. Heterotic Strings and Compactification 


The shifted masses are now given by 


10—D 


1 a 
to'm? = 5 Yo (a? M?/R? + R°N?/a?)+NH4N, 
i=l 


where N and N are energy operators for the two different sectors of the closed 
string. 

In this discussion, we have compactified the 10 — D-dimensional space on 
a torus. However, this is phenomenologically undesirable, because it leaves 
us with N = 4 supersymmetry and without any chiral fermions. It is more 
desirable to compactify on more sophisticated surfaces, such as the lattice of 
a Lie group (see Appendix). We now turn to the heterotic string, where we 
compactify on the lattice of Eg @ Eg, or Spin(32)/Zp. 


9.2 The Heterotic String 


Let us now begin a discussion of the heterotic string of Gross, Harvey, Martinec, 
and Rohm [6]. We saw in the last chapter that the anomaly cancellation required 
either O(32) or Ex @ Eg. However, we also saw that Chan—Paton factors are not 
compatible with exceptional groups. Thus, we are forced to use the mechanism 
of compactification to generate the isospin group E; ® Eg. 

The heterotic string is a closed string with the unusual feature that it treats the 
compactification of the left- and right-moving sectors separately. In the left- 
moving sector, which is purely bosonic, let us begin with a 26-dimensional 
string where 16 of the dimensions have been compactified on a lattice. (The 
16-dimensional lattice we will eventually choose will be the lattice of Ex @ Eg 
or Spin(32)/Z>.) 

The right movers, on the other hand, are supersymmetric; i.e., they con- 
tain the GS Majorana—Weyl fermionic field S“(t — 0), where the index a 
runs from | to 8 (in the light cone gauge), and also the spacetime string 
field X': 


Left movers Right movers 


X'(t +o) X'(t —o) 
X'(r +0) S4(t —o) 


where /, which labels the directions of the 16-dimensional lattice, runs from 
] to 16, and, which is the space-time index, runs from | to 10 — 2 = 8 in the 
light cone gauge. 

Notice that the spacetime X' appears in both in the left- and right-moving 
sectors. Putting everything together, we now have the light cone action for the 
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heterotic string: 


16 
= far fe do ( i OSX! pes + iSy~(0, + 0,)S 


(9.2.1) 
In this action, we have to enforce the constraints that place the various fields 
in their proper sector: 
Ca On )X! = 0, 
y7S=3(1+ nS =0, (9.2.2) 
| 6 9 
y Wend yo 
This action is explicitly supersymmetric under the variation 
5X! = + —1/25 iS 
Mee (9.2.3) 
6S" =i(p") “y-vy(0; — 0,)X"e. 


Let us now write an explicit breakdown of the theory in terms of normal 
modes (notice that we have slightly changed the normalization of the spinors 
from that of.(3.8.8)): 


j a ee 
G4 63 = a) = 5x! ab tpi(r = a) ats Si ~~ pede ake 
=! 
; ae Sey 
X'(c +0) = px) + ppc +o) + Fi Dente), 
n=] 
ee) : 
(ey Se, (9.2.4) 
é nn ae 
XG +o) = x! ae pc at a) aie ti s lies 
n=} 


Similarly, we can read off the canonical commutation relations of the theory 
directly from the action. From these, we can, in turn, write the canonical 
commutation relations between the various oscillators: 


oe 18 
(ee lao —nogeo, 
[q,, Gn) = 0, (9.2.5) 


(ae b= 1b, nd 


(Si, Sh} = (VTH) Sam 


n? 
‘where h is the chirality projection operator h = $(1 + yi1). 
(There is one commutator, however, that is rather subtle. Notice that we have 
’ constrained the compactified degrees of freedom to be left-moving. Thus, the 
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quantization of these modes must take into account this constraint, or else there 
will be an inconsistency within the theory. Fortunately, the only modification 
made by this constraint is to multiply the canonical commutation relation by 
half: 


[a p s08"?. (9.2.6) 


This can be explicitly checked by using the Dirac bracket formulation of gauge 
constraints or by checking that the newly defined commutator is consistent with 
the constraints.) 

Let us define the number operator for each sector: 


N=) (i,o4 + 3nS_ny7 Sn), 


n=] 
~ eS . . 
N=) Ga ae) (9.2.7) 
n—I 
This means that we can categorize the oscillators in both sectors as follows: 


Left movers Right movers 


~ 


a a’ 9.2.8 
ae sa 2) 
N N 


With these definitions, we find, as before, that the mass can be written as 
- 1 <6 , 
im? = N +(N — Der DP (9.2.9) 
I=1 


In addition, there is one extra constraint coming from the fact that we have 
a closed string: the theory should be independent of the origin in o space, as 
in (2.8.7). Thus, if U(@) is the operator that rotates a closed string by @ in the 
o parameter, then we must satisfy the following for physical states: 


U(@) |phy) = |phy) . (9.2.10) 
For the heterotic string, the rotation operator is 


: 1 ; 
2i0 (Hei 5 30h") (92.11) 


— ffs) 


U(@) =iexp 


This operator, acting on an arbitrary function of 0, produces the transformation 
U(@)F(o)U'(6) = F(o + 8). (9.2.12) 


To ensure that U(@) = | on physical states, we must satisfy the following 
constraint: 
16 


: 1 
ae le Ga. (9.2.13) 
fea 
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Up to now, we have not endowed the 16-dimensional lattice with any 
structure. To fix the theory, we take the space to be 
- R'6 
oe 
where A represents an as yet unspecified lattice in 16 dimensions. 
Let us span this 16-dimensional lattice by basis vectors: 
en (9.2.15) 


To say that we will compactify the 16-dimensional space with a lattice simply 
means that if we walk in the direction L’ that is specified by one of the lattice 
vectors, we eventually wind up at the same spot. With this basis, we want the 
center of mass string coordinate to be periodic along any of these basis vectors: 


Ga) CE la bu 


1 16 
L' = —) n¢e!R,, (9.2.16) 
/2 dX, 


loa (9.2.14) 


where the n; are integers and R; are the radii of the various compactified 
dimensions. Thus, if we wander off in any of the directions labeled by the 
basis vector, we wind up back at the original position. This places a nontrivial 
constraint on the momenta. We know from ordinary quantum mechanics that 
2p! is the generator of translations in the / th direction. Thus, periodicity means 
the following displacement operator, acting on states, must have an eigenvalue 
equal to one: 


ef2nprL! — |, (9.2.17) 


This operator simply generates the displacement in (9.2.16) and makes a trans- 
lation completely around the torus in the /th direction, until we wind up at the 
original spot. 

Thus, the canonical momenta must be defined as 


16 
pi=V2) mej /R:, (9.2.18) 
—) 


where m is an integer and the e* define the dual lattice A* such that 
D 
ee (9.2.19) 
fill 


At this point we should mention that the lattice over which we compactify 
the theory is not totally arbitrary. First of all, it must be an even /attice because 
of the constraint (9.2.13) due to demanding o independence of the Hilbert 
space. Notice that the U(@) operator in (9.2.11) can be set equal to one on the 
Fock space only if $(p’)? = integer, i.e., 


(p') = even integer. (9.2.20) 
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Therefore the winding numbers L’ must lie on an integer, even lattice. 
The metric tensor for the lattice corresponding to a Lie group is defined as 


16 
Sip yi efel, (9.2.21) 
{=i 


we find therefore that the metric must be integer-valued and g;; is even. Second, 
the lattice must be self-dual (i.e., the lattice vectors are equal to the vectors 
on the dual lattice) because of modular invariance, which will become more 
apparent as we discuss the first loop diagram in Section 9.6. There are very few 
even, self-dual lattices. They exist only in 8n dimensions. In eight dimensions, 
there is only one, Is, the root lattice of Eg. In 16 dimensions, there are only 
two of them: 


I; x Ts, lates (9.2.22) 


which fixes the lattice to be either Eg ® Ex or Spin(32)/Z2. Thus, we actually 
have very little choice in the selection of the gauge group. 
The metric for Eg 1s as follows: 


es 
Se 
=) 2a) 
a Eas (9.2.23) 
ee ae | =i 
= | 
=) 
Si 2 


There are several ways in which to describe the root lattice for Ey. One of the 
simplest is as follows. We have 84 root vectors given by 


aE; Stes Psi = 7, (9.2.24) 
and 128 root vectors given by 
+(te; te, te; + --- + eg) (9.2.25) 
and 28 vectors given by 
+e; eg. (9.2.26) 


Including the eight members of the Cartan subalgebra, that gives us a total of 
248 generators, which form the adjoint representation of the group. 

Later in this chapter we will also use the concept of simply laced groups, 
i.c., groups that have roots of the same length. The simply laced groups are of 
the type A, D, and E. 
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. Let us briefly summarize the kinds of Lie groups we will encounter in our 
discussion of the heterotic string: 


Lattice Roots 
Even (p'y = even; (g;; = even) 
Self-dual ne AS 7(detey) — 1) 


Simply laced Same length; (A, D, E) 


These lattices are important because the U(@) = 1 condition forces us to 
have even lattices, and modular invariance (as we shall see in Section 9.6) 
forces us to have self-dual lattices: 


U(@) = 1 — Even lattice, 
Modular invariance — Self-dual lattice. 


In 16 dimensions, the only even, self-dual lattices are the root lattices for 
Eg @ Ex and Spin(32)/Z. 


2 oe opecirum 


Let us now investigate the spectrum of the theory. First, we note that the 
right movers within the theory are explicitly space-time supersymmetric. The 
generator of space-time supersymmetry can easily be constructed from the 
variation of the fields. This, as we saw in earlier chapters, allows us to construct 
the current associated with the symmetry. We found earlier in (3.8.22) that the 
light cone supersymmetric operators can be written as 


O° = i/p*(y* So)" + 2i(pty'? > (4S_n)*au, (9.3.1) 


which acts on the right movers. We can easily show, given the canonical 
commutation relations, that 


(Q°, Q”} = —2(hy" Py)”. (9.3.2) 


To analyze the left-moving sector, we notice that isospin is introduced in an 
entirely different fashion than the Chan—Paton factors. Contrary to the usual 
Chan—Paton factors, which simply multiply the amplitudes by the trace of a 
series of generators, the isospin factors emerging from the compactification 
process astronomically expand the number of isotopically allowed states. No- 
tice that the index / is placed on the harmonic oscillator itself, which greatly 
proliferates the number of allowed states of the Fock space. 

In general, since the two sectors don’t really communicate with each other, 
a member of the Fock space of the heterotic string will be the product of the 
left- and right-moving sectors: 


|0)z x |0),. (9.3.3) 
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At the lowest level, let us first investigate the right-moving sector. As before, 
in (3.8.13) and (3.8.14), the eight states in the fermionic |a) or bosonic |i) 
ground state of the superstring can be written in terms of each other: 


i 
la)r= g (viSo)” lidr 


lie) = <a1S0l7, Y41F lade (9.3.4) 


Thus, we have a total of 8+ 8 = 16 states in the right-moving massless sector. 
Now let us investigate the left-moving sector. We have 


a! , 10), — 8 states, 
&' , 10), —> 16 states, (9.3.5) 
[p's (p'y = 2) —> 480 states. 


(It is very important to realize that this particular representation of the spec- 
trum breaks manifest Ex; @ Eg symmetry. To see this, notice that there are 16 a’ 
operators, which correspond to the dimension of the root lattice of the group, 
but these 16 operators do not form a representation of the group. Similarly, the 
vectors |p’: (ply = 2) correspond to the 480 roots of length two. However, 
this also does not form a representation of the group. Only when we add 16 
and 480 and form 496 states do we finally construct the adjoint representation 
of the group. At the massless level, it is still possible to recombine the broken 
states into Ex @ Ex multiplets, but it becomes increasingly prohibitive at higher 
mass levels. Later, we will give an argument, based on Kac—Moody algebras. 
demonstrating how we can show that the entire spectrum is, in fact, Es ® Ex 
symmetric.) 

At the massless level, we obtain the spectrum by multiplying the right and 
left movers. The total number of states in the lowest level of the heterotic string 
is thus the product of the left and right movers: 16 x 504 = 8064 states. The 
breakdown of these states is now given as follows. Of the 8064 states, we find 
that 128 belong to the N = 1, D = 10 supergravity (in the light cone): 


supergravity > a@'_,|0),; x li ora)p. (9.3.6) 
If we break this down into distinct states, we find 
graviton > a@'_,|0), x |/)p + (ie J), 
antisym. tensor > &@_, 0), x |j)p —(i © J), (9.3.7) 
gravitino > a@'_, |0); x la)pr. 


Other states belong to the super- Yang -Mills theory defined on Ey @ Ey. For 
example, the super-Yang-Mills multiplet (A“. y“) is in the adjoint represen- 
tation of the group, which has 496 states. These 496 isotopic states are given 
by the 480 + 16 states contained in the left-moving sector: 


Gi! 10) pe | pp’; (p')? = 2), . (9.3.8) 
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Thus, the super-Yang—Mills state multiplet can be represented as 496 x 8 states: 
super-Yang—Mills — [a , |0), +|p’),] x li ora),. (9.3.9) 


In summary, we have exactly 10-dimensional supergravity and Yang—Mills 
fields at m? = 0. 

Notice that we had to combine 480+ 16 to yield an irreducible representation 
of the group. This means that, in general, the Fock space of the heterotic string 
does not manifestly recombine into Eg @ Ex multiplets. At the next level, for 
example, the states form irreducible representations of the algebra only in the 
last steps of the calculation. 

At the next level, where m? = 8, for the right-moving sector we have 128 
bosons and 128 fermions: 


128 bosons 
Su 18)R 


a! la)p, 


S10 
ein li)p- : 


128 fermions 

Notice that we have formed bosons out of the tensor product of two fermions 
in the above equation. 

The left-moving sector is considerably more involved, with a total of 73,764 


states. We have a total, therefore, of 256 x 73,764 = 18,883,584 states! The 
scalar states are 


|p’, (p'y = 4), > 61,920 states, 


x! lps (p’y = —> 7680 states, 
ee °- 1 Vaal h | (9.3.11) 
a_,a_,|0); — 136 states, 
&!, |0), — 16 states, 


for a total of 69,752 scalar states. (To count the states in |p’, (p!)’ = 4), we 
used the fact that the number of points of length squared equal to 2m for integer 
m on the lattice I)5 is 480 times the sum of the seventh powers of the divisors 
of m. Thus, for this states, there are 480(1 + 2’) = 61,920 states [6].) 

At first, it is not obvious at all that these 69,752 scalar states can be rearranged 
in terms of Eg ® Eg multiplets. However, a careful analysis shows that they 
can be broken down into the following Eg ® Es multiplets: 


(3875, 1) + (1, 3875) + (248, 248) + (248, 1) + (1, 248) + (1, 1) + (1, 1). 


(9.3.12) 
The 3976 vectors in the left-moving sector are arranged as 
a, |p’, (p'y =2), — 3840 states, 
vectors > a'_, |0), — 8 states, (9.3.13) 


a_,a!, |0), — 128 states, 
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and we have the tensor state 
a_,@/ , |0), —> 36 states. (9.3.14) 


The total number of states in the left-moving sector is thus 73,764. 
At higher levels, the degeneracy climbs exponentially as 


d(M) ~ e2+v2inva'M | (9.3.15) 


It was not obvious that these states can be reformulated in terms of 
Es ® Eg multiplets. At higher levels, it seems like a hopeless task to show 
that higher masses all can be rearranged into irreducible representations 
of the group. What we need, of course, is a higher symmetry to prove 
this to all orders. We will do this when we apply Kac—Moody algebras in 
Section 9.7. 


9.4 Covariant and Fermionic Formulations 


Although we have analyzed the heterotic string’s spectrum in the light cone 
gauge, we can also write down an explicitly covariant version of the first 
quantized action. We begin by writing the right-moving supersymmetric sec- 
tor in covariant form. Let P,, and Q® represent two of the generators of the 
super-Poincaré group. Let us generalize the operator that gives us transla- 
tions in the x direction. An element of the supertranslation group is given 
by 

ee (9.4.1) 


where @ is a 10-dimensional spinor. This operator will shift a function of 
the coordinate by X and a fermionic coordinate by 4. Now construct (wa = 
le) 


Ty =h'dgh = (0,X" — i0y",0)P,, + 8,60. (9.4.2) 


Now the right-moving GS action can be written as 
Sp = [ete Tr(IT, Mp ete? + / ea eI Me Mi, ) 
+ i awe (eu). (9.4.3) 


The first term on the right is the usual quadratic term of the GS action. The 
second term is the nonlinear term of the GS action written as a Wess—Zumino 
term. It 1s a three-dimensional integral over a surface whose boundary coin- 
cides with the world sheet of the string. The sum of these two terms is an 
alternative formulation of the GS action. The third term of the action just 
enforces the right-moving constraint. 


9.4 Covariant and Fermionic Formulations 387 


Similarly, the left-moving action that contains the isotopic sector can also 
be written covariantly, but we have a choice of using either fermions or bosons. 
We know from the theory of Lie groups that the generators of an algebra can 
be written in terms of the product of either bosonic or fermionic fields. Thus, 
we have 


Sa / a’ zew' dy’ e%, : (9.4.4) 


where this time J = | to 32, which labels the fundamental representation 
of SO(32). It is important to note that these fermions transform as Lorentz 
scalars. The / index is an internal index. We can, of course, choose either 
Ramond (periodic) or Neveu-Schwarz (antiperiodic) boundary conditions on 
these fermion oscillators. 

For the boson representation, we can represent the same left-moving isotopic 
sector as 


Ss, = / da? zhe(etda,X') + er (e* 4X)’, (9.4.5) 


where / = 1, 16. This, of course, yields the light cone representation we used 
earlier when we broke Lorentz covariance. (It may seem strange at first to 
see the left and right sectors labeled by +, which seems to single out a non- 
Lorentz-invariant direction in two dimensions. However, the above formulation 
is still reparametrization invariant because the + directions are placed in the 
tangent space. Thus, two-dimensional reparametrization invariance remains 
intact.) 

Now let us write the entire space-time action (minus the isospin part) that 
contains both left- and right-moving sectors in a covariant fashion: 


S= [ze {5 (Cf OqX") — ir! p-e2 Oa Wy + Fi Xa? p? Ww" pg X 


(9.4.6) 
This action has “N = ” supersymmetry: 
Ol — Hepat”, 
= OXg = —2Va 8; 
5X4 = iew", (9.4.7) 


byt = [0,X" + ix ple. 


Now let us, instead of choosing the light cone gauge, choose the conformal 
gauge. Then the heterotic action reduces to 


S=5 / A2((dyX") +iv'p dv, +iv'ptay). (9.4.8) 


The point of all this was to show that a covariant version of the heterotic 
string exists and that we can use either fermion or bosonic fields to represent 


the isospin part of the fields. We are not bound to a light cone formulation in 
’ terms of boson fields. 
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It should be noted, however, that even in covariant fashion the heterotic 
string appears a bit awkward and contrived. Perhaps a future version of the 
theory will allow a more elegant formulation. 


9.5 Trees 


Trees are constructed in the heterotic string almost exactly the way they were 
constructed for the light cone superstring, except now the vertices have left 
and right components and we also must take into account the compactification 
on a 16-dimensional lattice. 

To describe the supergravity multiplet, we will introduce the spin-3 field 
U** to represent the spinor field and p“” to represent the polarization tensor 
of the graviton. In the on-shell light cone gauge, we will choose the external 
particles to be massless and transverse: 


poe 23 ptt a koe" = 0, 
Ut = kU" = yt = 11 — yy)U" = 0. (9.5.1) 


In analogy with the superstring light cone vertex developed in Chapter 3, we 
can construct the vertex functions for the heterotic string based on analogy with 
the vertices in (3.9.2), except we need an extra insertion of P“ to yield more 
Lorentz indices. In addition, the vertices are the direct product of left-moving 
and right-moving vertices. We now choose for the boson and fermion emission 
vertices for the supergravity sector: 


Vp = pwtk) | do BY P’ elk x, 
0 


supergravity: _ - (9.5.2) 
Vr = | do F°P,U" (ke, 
0 
where we use the definitions in (3.9.6): 
k* = 0, 
B= Pee RY 
ax —— 
p= elt i ,—2in(t—o) 
d(t —c) 2P + Dome ‘ 
n#0 
Ra aS) eS, (9.5.3) 


Fo = Si(pty [Sy - P — LRU Syi:7¢, 
P! = tpi te Boe. 
n#0 


(Notice that we have explicitly chosen the frame where all k* are set to zero, 
where we have a large simplification. The calculation for nonzero k* is actually 
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quite complicated because it requires making a Lorentz transformation with 
the broken Lorentz generators M—'.) 

The vertices for the gauge mesons can also be written down. One compli- 
cation, however, is that the 496 vector mesons in the adjoint representation 
of the gauge group can be broken down into 16 “neutral” gauge bosons 
transforming as members of the Cartan subalgebra, and 480 “charged” 
_— corresponding to K‘ with (K')? = 2. The neutral ones are defined 

y 


li)a x &, 0), (9.5.4) 
and the charged ones by 
ex |K 3(k ) =2) (9.5.5) 


Thus, we must have two types of vertices for the gauge bosons. The vertices 
for the neutral gauge mesons are essentially the same as before, except that we 
interchange the Lorentz index for an internal one: 


vi= oid) | do BY P! efkuX" 
neutral gauge mesons: C0) 


Ve = iP doh Pathe = 
0 


where 
ap ax! 


at =p! Calf eo 2in(t +o) 
~ d(t +0) ns da 


n#0 


Correspondingly, the vertices for the emission of the charged gauge particles, 
which are functions of an internal momentum K’, can be found 


VE = p,(k) | go nite ee (Kg), 
charged gauge mesons: 
vx =i) do F*U%(k)ethe®" e2!K'X".C(K), 

0 


(9577) 
where the normal ordered part of the vertex comes from the left-moving sector 
and we will define C later. 

One way of checking to see that we have the correct form for the vertex 
functions is to explicitly operate on them with the supersymmetry operator 
(9.3.1) to verify that boson vertices turn into fermion vertices and vice versa. 
However, the proof, which follows the proof given in Section 3.9, is rather 
involved and will not be presented. 

The propagator of the system can also be found easily by generalizing the 
closed string propagator. We will, however, find it convenient to absorb the o 
and t dependence of the vertex function and put it into the propagator. This 

can always be done because the o displacement matrix, as we saw earlier, is 
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given by U(c) and the t displacement operator is generated by the light cone 
Hamiltonian: 


16 
H =1p?+2N+2N-1)+ )('Y. (9.5.8) 


| f= 


The vertex function, before this extraction, is 
V(k,t) =e? / doU(a)VoU'(a)e™. 
0 
The vertex function that we want is 
Vo = V(t =o = 0). (9.5.9) 


The propagator, which now absorbs the o and t integrations, becomes 


lo, @) eg 
A =i, ar | aoe tU(a) 
0 oe 


=|! d2z|z|V/aP 2 2N ZN 1H E ply 
|z|<1 


i » l I\2 
7 (-#41-F De"). (9.5.10) 


The above propagator is just what we might have expected. The delta func- 
tion simply enforces the constraint (9.2.13) that makes the states independent 
of a rotation in a, and the poles are given by the inverse of the light cone 
Hamiltonian. 

The N-point function can therefore be written as 


(0, ki] V(k2)A ++ AV (kw1) [0, kw) « (9:55 
Let us now construct the amplitude for the scattering of four massless gauge 
bosons with momenta k;, polarization o,, and charge K;, where 


A 


> ki Bi: 


wl 
4 
DEK = 0, (9.5.12) 
i—1 


amily 

The calculation of the four-point function is long but straightforward: 
Aq = 8° K(0;, ki) 

P(-1+ 945) (-14344)r(-14+£+4 2) 

is - ec z 


Clase) b (he Geis) 
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where 
K = —4[stp1 - 0302 + 04 + Supy + 0301 + ps + tupy + P23 + pa] 

— 45[01 + k403 + k2p2 + pa + pr + k304-k1p1 + 13 
+ Pi -k304 + k2p2 + 3 + pr + kgp3-k1 01 - pa] 

— $t[ 02 + ki 4+ ky ps + p1 + 03 + kar - kx 02+ ps 
+ £2 + k4p + k303 + pa + p3 > ki p4 + kr pr + ps] 

— 5ul pr - ko pq + k303 + 02 + 03 kapr-kipr + pa 
+ 21 + kar + k303 + P4 + 03+ k2p4- ki pr + pr], (9.5.14) 


where ¢ is a phase factor and 


s=—-(ki +h, 
t= —(ky +k’, 
u=—(ki +k, 
s+tt+u=0 , 
S=(K,+ K2), (9.5.15) 
T = (K, + K3), 
U =(K;+ K3/, 
Ree) Se UE ats) « 
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The real test of the theory, however, is the calculation of the single-loop diagram 
[6, 7], where we demand that the theory be finite. It is now a straightfor- 
ward process to write the single-loop diagram in terms of these vertices and 


propagators: 
Aico CN Aes AV (1), (9.6.1) 
where we consider scattering by charged gauge mesons. 
Again, the trace calculation is long but straightforward. After the trace is 
performed, we have 
4 5 
—4r 
_¢ 221.,,\-2 ~—1 ¢¢\—24 
NR | ie z;|w| E =| w f(w) 


= 


Gr ol ew)" "1, @.6.2) 


I<i<j<4 
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where 
In? |z| |J1—z UA — wz) — w/z) 
= =< —____—_____———|, (9.6.3) 
PED ate iat oa Zz I] (1 — wp 
a f=7 2/0 -—w oe nh) 
= —_—_—_—____.——— ],, (9.6.4 
ee 2 
ee ] = NZ; ; 
OS CT ay tes) Y > exp 5 In ’ 5 no.) ; (9.6.5) 
PeA fel 
and where 
ff 
QO; — > Ki, 
j= 
: In Z; 
i= 9.6.6 
4 d, wal ( ) 
= In w 
Dari 
Cyt = 2iZi41 °° *Zjs 


where the sum over P € A represents the sum over all points in the lattice and 
K isa kinematic factor identical to the one found for the trees in (9.5.14). Be- 
cause the final result bears so much resemblance to the single-loop superstring 
amplitude, it is not hard to show that the amplitude is invariant under 


gargs 


O69) 
ve => Db sr ue, ( ) 


Slightly more difficult is the proof that the integrand is invariant under tT > 
t + 1 and t + —t™!, which is necessary to prove modular invariance. 
Fortunately, most of the terms in the integrand are identical to those found 
in a single-loop amplitude in (5.5.1). However, we must check the invariance 
of the terms that are different, i.e., the factors dependent on the lattice. 
As before, the theta function, if we make a modular transformation on its 
arguments, transforms as follows: 


v 
©) | ae 
ct+d 
where e* = 1. Since the x function can be written in terms of theta functions, 
we have 


ene 


) &(ct +d)" exp gl 


at+b 
ct+d 


) @,(d|z), (9.6.8) 


a7 )- 1 ae 
ct+d [cor ea oe 


( v 
a ct+d 
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The partition function f(w) also transforms as 


f(w) = (i/t)'? ww" Fw’), 


— a td 9.6.9 
2ni tT Inw’’ Sacto, 
The most important transformation is the one for L: 
—22\° 2n? 4. Q;Inz; \" 
LQ, 2), Ki) = : —— {| P— — 
( (=) Eee In w (> y 20 
1 a ; 
a In Z; : 6. 

a (> 0; na) (9.6.10) 


This last identity depended crucially on the lattice being self-dual, which is 
perhaps one of the strongest arguments for this restriction on the lattice. Let 
us put all these factors together: 


Tie. 0) a I] w(djilt)"™ L(t, v;) 


sss ype 


=e = /\—1 —f\~24=12 Vij 
(wy Fwy "t TI v( : 


Isi<j<4 


2 
—| ae Eng i ms = 
«L(S=)@ exp 2 (yon) (9.6.11) 


There are two factors in (9.6.11) that apparently violate modular invariance, 
the various factors of tT and the exponentials involving T. Fortunately, both sets 
of terms vanish. The factors of T cancel because 


kik, =e 


=a 


Wests 


This leaves us with 7!*-4-*® = 1. The exponential factors also vanish because 


4 Z 
D 07, Ki i (> on . 
1<i<j<4 i=l 
Once we eliminate the factors of tT and the exponentials of tT in (9.6.11), we 
will find that this combination is modular invariant. 

The invariance of the amplitude under v; — v; + | and vy, > v; + T means 
that we can truncate its region of integration: 


0 < Imp; < Imrt, 
—} < Rev; < 7: (9.6.12) 
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Also, as in the case of the closed single-loop diagram because of invariance 
under t > t +1 and t — —1/t, weare free to choose a fundamental region: 
1 


ai a 
<Rets>, (9.6.13) 


2 
|r|]. 
Once again, the choice of this fundamental region allows us to avoid the 
potential singularity at t = 0, and hence we have a finite one-loop action. 

In this discussion, modular invariance was crucial in showing that the ampli- 
tude, like the usual superstring, is finite. We can simply choose a fundamental 
region where no singularities are present. However, the calculation obscures 
precisely why the theory is modular invariant. Let us analyze the simplest 
case and isolate the point at which modular invariance comes in. To simplify 
matters, let us consider the vacuum one-loop amplitude with no external legs. 

Let us first define a function F (which appears in the trace calculation in 
(970.1): 


Fr, X= ee (9.6.14) 
LEA 


where we sum over lattice sites, X is an arbitrary |6-dimensional vector in the 
root lattice space, and each site is labeled by integers n;: 


= 7 ne). (9.6.15) 
Notice that F' is periodic: 
F(t, xX) =f xe) (9.6.16) 


because this shift can be simply absorbed into the integers n,. Let us now write 
this function in terms of its Fourier transform F: 


FU k= eee | rN: (9.6.17) 


MeA* 


Notice that, because F is periodic, the M’s must necessarily lie on the dual 
lattice: 


16 
M! = > mer". (9.6.18) 
il 


Now take the reverse Fourier transform to find F in terms of F: 
e Le eee 
F(t, M) = | as EGY): (9.6.19) 
1k | 


where ,/|g| is the volume of the torus. Now insert the expression for F into the 
previous equation. Perform the integration, and we arrive at an explicit form 
for F: 


is | : 
F(t, M) = vie o (9.6.20) 
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Now we would like to insert this identity into the expression that actually 
occurs in the one-loop vacuum calculation. We must calculate the trace over 
the Hamiltonian, which contains the piece (p ')? Thus, in the trace calculation 
the function f appears: 


ON eS) at a (9.6.21) 
LEA 
From (9.6.14), we have 
F(t, 0) = f(t)t*. (9.6.22) 


Let us now substitute the expression for F', and we have an expression for ie 


= imM-M/t 
1= Fa (-i) Ee 


yeas (-:) . (9.6.23) 


where f* is nothing but the usual f defined in (9.6.21) over the dual lattice. 
This is the key result. 

In summary, we see that the modular transformation t — —1/t has re- 
placed the lattice with the dual lattice. Thus, for modular invariance we demand 
that the lattice be self-dual. In fact, this is the origin of the condition that the 
lattice be self-dual. Here, we see the tight link between self-duality (which 
eventually restricts us to either Eg ® Eg or Spin(32)/Z2) and modular in- 
variance. Although the original choice of these groups came from the anomaly 
cancellation, we see that we need precisely these groups or modular invariance 
and finiteness of the amplitude. 


9.7 Eg and Kac—Moody Algebras 


Earlier, we saw that the spectrum of states was not manifestly Eg® Es invariant. 
Only with difficulty were we able to show that the lowest-lying states could 
be placed in irreducible representations of the group. Because the number 
of states rapidly proliferates into tens of millions, it becomes prohibitive to 
actually rearrange the states into Ez ® Ex multiplets. 

In this section we will use the techniques of Kac—Moody algebras [8-10] 
developed in Chapter 4 to show, to all orders, that the heterotic string does 
indeed have a spectrum that is Eg ® Ex symmetric. We will use the vertex 
operators defined in that chapter to generate the representation of a Kac 
Moody algebra (which works only if the lattice is even, self-dual, and the 
algebra is level 1). 

The representation that we desire is the Chevalley basis, where the 496 
generators are broken up into 16 mutually commuting generators (which form 
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the Cartan subgroup) and the 480 root vectors. Notice that the 16 p’ vectors 
obey the condition 


[p’, p’] =0. (9.7.1) 


Trivially, we can always represent the Cartan subalgebra as simply the mutually 
commuting vectors p’. More difficult, however, is the construction of the 480 
root vectors. 

The simplest operator with 480 states is the vertex operator, which we can 
write in terms of a line integral surrounding the origin: 


Hk) = f Mee zoe CK: (9.7.2) 
Miya 


where the vertex function V is defined over the lattice (rather than over space— 
time): 


V(K, z) = eR’ XL), (9.7.3) 
and where 
2) 
z= et{tte) (9.7.4) 


where the cocycle is as yet unspecified. Notice that the vectors K’, by 
definition, point in 480 directions in the 16-dimensional root lattice space. 

We now demand that the 16 elements of the Cartan subalgebra p’ and the 
480 elements of root lattice space obey the commutation relations of Es ® Eg. 
This will, in turn, fix the C operator. We demand 


a(K, LNE(K + L) CR Ly = 2, 


LEK), ECD eee ik + D0, (9.7.5) 
0 otherwise. 
and 
(pi, EO KECK, (9.7.6) 


where €(K, L) are the structure constants of the algebra with values +1. 

In some sense, we have done nothing yet. We have simply written down the 
commutations of the algebra of Ey ® Eg in the Chevalley basis, which are well 
known, and then demanded that our ansatz satisfy them. The nontrivial part is 
that a solution of these equations actually exists which fixes the value of C. 

For the moment, let us drop the factor C and see if we can satisfy the 
commutation relations. It is easy to show 


V(K, z)V GW) = Gwe) Ee wy) eh Ee) oa 


which is valid for |w| < |z|. 
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From this, we can show 


; dw d 
eB, (CF EE = § $5 ae 
x (z- wk E(yyz) UAE L pK-X(Z)+2L-X(w) 


(9.7.8) 


where the w integration is performed such that |z| > |w| in the first term and 
|z| < |w| in the second. By carefully examining the last expression, we find 
that we have satisfied all the identities of the Chevalley basis except that the 
statistics are all wrong. Anticommutators arise instead of commutators. That 
is why we had to introduce the factor C(K ). The condition we must impose is 


C(K)C(L) = &(K, L)C(K + L). (9.7.9) 


The C(K) operator is called a “cocycle” or “twist” and was added into the 
definition of the generator E(K ) in order to get the right statistics. Acting on 
states of momenta p’, we can show that C(K) has the property 


C(K)|p) = &(K, p)|p). (9.7.10) 


If we demand that this law be associative, then we have a restriction on 
these phases. By acting on states successively by the C’s and demanding 
associativity, we easily arrive at 


e(K, L)e(K + L, M) = &(L, M)e(K, L + M). (9.7.11) 


We call these the two cocycle conditions. Several explicit representations of 
C(K) and the phase exist. We can always choose 


e(K, L)e(L, K) = (-1)"", 
é(K, 0) = —e(K, —K) = 1. (S712) 
In the Appendix, we show that the generators of the Lie algebras can be 
arranged in terms of the Cartan generators H; and the eigenvectors E,. We now 


see the exact correspondence between the operators appearing in the heterotic 
string and the generators of a Lie group: 


Heterotic Cartan—Weyl 


p’ Hi, (9.7.13) 
E(K) jae 
Ke a 


The last step in the proof is to notice that the 16 p’ and the 480 E(K) 
generate the vertices of the theory (e.g., see (9.5.6), (9.5.7)), so therefore the 
Fock space consists of these operators acting on the vacuum. But since these 
generators together make up the 496 generators of Ex ®@ Es, the entire spectrum 
itself must be arranged according to irreducible representations of Ex ® Ex. 
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9.8 Lorentzian Lattices 


So far, we have been discussing heterotic strings in which one sector is 
compactified from 26 dimensions down to 10. Then, according to con- 
ventional wisdom, we must compactify the 10-dimensional space down to 
D-dimensional space-time. Yet another intriguing possibility is to directly 
compactify the 26- and 10-dimensional spaces down to D dimensions from 
the very beginning, bypassing the intermediate stage. This is the approach of 
Narain [11,12], which yields gauge groups of rank 26 — D, which is larger 
than the Es ® Eg considered so far. 

Let us begin with the two sectors that are 26- and 10-dimensional and now 
compactify them both down to D space-time dimensions. Then the left-moving 
sector has 26— D = p dimensions, and the right-moving sector has 1|0— D = q 
dimensions. Now parametrize the compactified dimensions as 


eae =q' as IL AG: Eo) + eee sate tite), 
“m0 2 


Me Gee ae ar ave ay, (9.8.1) 
«zo 2 


where A ranges from | to p and B ranges from | to qg. Notice that we have 
the relation p = q + 16, which guarantees that the number of uncompactified 
spacetime dimensions for both sectors is D. 

Now impose the constraint that Lo — Lo annihilates states, so there is no 
preferred o origin for the closed string. This yields the conditions 


(k? —k?) = N—N +1 = integer. (9.8.2) 
The mass relationship is now given by 
tm? = 3k +3kh+N4+N—-1, (9.8.3) 


Notice that these equations all reduce to the usual heterotic string if we take 
P= ibe — 0. 

Now let us check that the final result is modular invariant. This will place a 
new restriction on the lattice, which at this time is still arbitrary. The single- 
loop calculation (9.6.2) done for the general case is almost identical to the one 
for the usual heterotic string. A careful analysis of the terms shows that the 
generalized case contains the new factor 


ie z(1 v4 Fal Pe teal FE 
Fe LE Kine? (9.8.4) 


Now calculate the change in this factor under the transformation t — t + 1. 
It follows that we pick up a new phase factor 


eim(L?— Lb?) (9.8.5) 
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In order to cancel this term, we are forced to have 
(L4) — (L?yY = even integer. (9.8.6) 


Because of this extra minus sign, the metric on the lattice is “Lorentzian” rather 
than Euclidean. (For the usual heterotic string, L is zero, so we never see the 
Lorentzian behavior of the lattice.) 

Finally, when we apply the modular transformation t —> —t~!, we find that 
the lattice must be self-dual. In summary, modular invariance at the one-loop 
level is preserved if we have an even, self-dual, Lorentzian lattice. 

Mathematically, it can be shown that such Lorentzian lattices do indeed 
exist, when we satisfy the condition p — g = 8n, for integer n. For our case, 
iN =e 

We can also calculate the number of parameters in this lattice. It can be 
shown that the Lorentzian lattice is unique up to an SO(p, q) transformation. 
But the mass formulas we have derived are invariant only under SO(p) and 
SO(q). Thus, the total number of parameters is 


dim SO(p, q) — dim SO(p) — dim SO(q) = pq. (9.8.7) 


Thus, the total number of parameters that characterize the lattice is p(p — 16). 

We have considerably expanded the number of groups that are available 
to us by this method of compactification. The total rank of the group is now 
26 — D. This means that we can have groups as large as SO(52 — 2D). In 
four space-time dimensions, this means SO(44). For D = 4, we can also have 
Ex ® Es ® E; ® SU(2), or Eg ® E? ® G, where G can be either SO(4) or 
SU(3). We can also have Eg ® Eg @ Gio_y, where G must be simply laced. 

At this point, we may conclude that Lorentzian lattices are an entirely new 
way to compactify the heterotic string. Actually, this is not quite right. It turns 
out that we can still represent the Lorentzian lattice within the conventional 
string picture of the heterotic string. Consider, for example, the following string 
action in the presence of background fields: 


81 9X! O° X! + 6% Bij Dy X'OgX! + °F Al dyX'dgX", (9.8.8) 


where / goes from | to 16, B;; is an antisymmetric tensor, and 6°? is an 
antisymmetric tensor in 2-space. Now assume the g;;, B;;, and A} fields are 
approximated by constant background fields. Notice that the total number 
of parameters in this approach is easily found by counting the number of 
independent modes within these fields: 


8ij 5q(q +1) 
Ai p> { 16q (9.8.9) 
Bij 79(q — 1) 

for a total number of pg parameters. Now quantize this system, assuming that 


X can be approximated by 
X' =2on' + q'(t). (9.8.10) 
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Now substitute back this expression into the action, quantizing the system in 
the presence of these constant background fields. The net effect of these back- 
ground fields is to add pg new parameters into the compactification process, 
which is precisely the number of new parameters introduced by the com- 
pactification on Lorentzian lattices. It can be shown, furthermore, that the 
conventional compactification in the presence of these background fields is, in 
fact, equivalent to compactification on Lorentzian lattices. 

In summary, the Lorentzian lattice not only is equivalent to the conven- 
tional compactification scheme when we include the presence of background 
fields but also provides a very convenient framework in which to catalogue an 
enormously large class of compactifications. 


9.9 Summary 


The anomaly cancellation requires that we have either O(32) or Es @ Eg. 
However, the Chan—Paton factors are incompatible with exceptional groups. 
Thus, we have no choice but to consider compactification as the process by 
which we can generate exceptional groups for the superstring. 

The simplest possible compactification process in one dimension requires 
that we make the identification 


+= x +o, (9.9.1) 


A one-dimensional scalar function must then be expanded in terms of periodic 
eigenfunctions: 


Gy one (9.9.2) 


where 
Nn 
——, 9.9.3 
p= Gsm) 


We see that momentum is quantized by taking periodic boundary conditions. 

The heterotic string, by contrast, compactifies 26 dimensions down to 10 
dimensions, leaving a 16-dimensional lattice. It is a closed string that sep- 
arates the right- and left-moving sectors. The right-moving sector is purely 
10-dimensional and contains the fermionic superstring and the bosonic string. 
The left-moving sector was originally 26-dimensional but was compactified 
down to 10 dimensions, leaving a 16-dimensional theory defined on a lattice 
and a 10-dimensional bosonic string (which combines with the right-moving 
bosonic string to complete a bosonic closed string). The final action, in the 
light cone gauge, is 


f= 


16 
i ri I zl -c —_ ‘ 
be zz arf do (ax hy, @ = - ) 0,X°0,X° +iSy~ (A, + 0,)S 


(9.9.4) 
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where we enforce the constraints 
(8, — 0,)X’ = 0, 
ays = 3(1 + yi) — 0. (9.9.5) 

This action is explicitly supersymmetric under the variation 
ax = phys 096 
Ss =i(p') “y-Yu(O, — ,)X"e. 


When we analyze the spectrum of the heterotic string, we must take into 
account the quantization of the momentum and also the winding of the 
string around the compactified dimensions. This yields the conditions on the 
spectrum: 


- 1 <6 
1m? = N+(N —1)+ . De (9.9.7) 

iI 
The final constraint on the spectrum arises because we want to make the closed 


string invariant under an arbitrary o rotation. The operator that generates this 
rotation is 


16 
U(0) = exp | 2i0 (x -N+i1-— 5 S0'') (9.9.8) 
T=) 


Thus, we demand 
e 1 <8 
N=N-1+5 De (9.9.9) 
=I 
Vertex functions can also be constructed for the heterotic string. They are, 
in fact, simply left- and right-moving products of the usual superstring ver- 


tices that were constructed earlier in Chapter 3. We choose the ansatz for the 
supergravity multiplet: 


. Vie putt) f do BYP’ elk * 
: 0 


Va= / do F*P.,,U%(k)e** , (9.9.10) 
0 
where 
k+ =0, 
ee eek Rh. 
i dX' lp —2in(t—c) 

ES] = 1 Giga et (9.9.11) 

d(t — a) os a: 


1 = 
RU = \Sy'i-5, 
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Fe li(p) VISy- 2 —_ RoR Sy Ae 
Pi st tpi ne Nila ecu. 
n#0 
With this vertex function and the usual propagator (plus constraint), we can 


calculate the four-point function for the scattering of four massless gauge 
bosons: 
; r(-1+44+%)r(-1+é+ 
Ag = 8 K(0;.ki)-€ ( Ean p+) 
(9.9.12) 
where K is a complicated function of the polarizations and S, 7, and U are 
defined as 


S=(K,+ K2y, 
T =(K.+ Ks’, 
U =(K, + K3y’, (9.9.13) 
S+T+U=8. 


Similarly, we can define the one-loop amplitude, which can be explicitly 
shown to be modular invariant. Thus, we have the freedom to eliminate the 
singularity at t = 0, which means that the theory is one-loop finite. 

When analyzing the spectrum of the theory, we found that it was pro- 
hibitively difficult to construct the spectrum in terms of Ex, @ Ex multiplets. 
The proof that the spectrum can be represented in terms of irreducible 
representations of that group can most easily be constructed using the Kac— 
Moody algebras. The Kac—Moody generators have the following commutation 
relation: 


[Ts Te) = iF Tin + km5 8m. —n- (9.9.14) 


Notice that this appears to be an ordinary Lie algebra that has been smeared 
over a circle. We can also rewrite this algebra in terms of the Cartan—Weyl 
representation, in which the generators look like 


LH;(@), H,(@) = 0, 
[H;(@), E,(6’)] = —278(@ — 6')a; E. (6), (9.9.15) 
and 


[Eq(0). E-al’)] = 275(H — 0') Y > a HH) + 27184 = 4"). (9.9.16) 


and 
278(0 —O )Egy3(0) ifa+ Bel, 


[E.(6), Eg(6’)] = | 0 otherwise 


(9.9.17) 


Using vertex operators, we can explicitly construct a representation of the 
Kac—Moody algebra. Thus, since the vertex operators generate the spectrum 


References 403 


of the theory, the spectrum itself must be invariant under Kac—Moody trans- 
formations. Therefore, even though the spectrum is not manifestly Ex ® Ex 
symmetric, we have established that it actually is symmetric under that group. 

New compactification schemes more general than the one presented are 
possible by using a different compactification scheme down to D space-time 
dimensions. By directly compactifying the left-moving sector from 26 down 
to D spacetime dimensions and compactifying the right-moving sector from 
10 down to D dimensions, we can bypass the intermediate step of the Es @ 
Ex heterotic string. The new feature of this compactification scheme is that 
modular invariance restricts us to Lorentzian lattices, i.e., lattices where the 
metric has alternating signs. It can be shown that such Lorentzian lattices are 
perfectly well defined. This thus gives us an entirely new class of heterotic-type 
strings with gauge groups as high as SO(52 — 2D). Although these models 
look much different from the standard heterotic string, it can be shown that, 
with constant background fields g;;, Aj , and B;;, we can generate these new 
classes of models. 
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CHAPTER 10 


Calabi-Yau Spaces 
and Orbifolds 


10.1 Calabi~Yau Spaces 


Although the heterotic string is a great advance over the usual formulation of 
string theory, the question still remains whether we can break the theory down 
to four dimensions and do rigorous phenomenology. The answer, unfortunately. 
is no. 

At present, our arsenal of techniques is still too primitive to answer the 
question of whether the theory undergoes spontaneous dimensional breaking. 
We will have to wait until further developments are made in string field theory 
or M-theory before any conclusion can be reached concerning the true vacuum 
of the theory. 

Surprisingly, it turns out that rather mild restrictions on compactification are 
sufficient to obtain reasonable phenomenology. Although none of the solutions 
so far agrees totally with the minimal model SU(3) ® SU(2) @ U(1), we come 
remarkably close with just a few assumptions on the classical vacua. 

Unfortunately, however, we find an embarrassment of riches. There appear 
to be thousands or even millions of possible classical solutions, and it remains 
to be seen how to choose one vacuum over another. Thus, although naive 
phenomenology can be performed on the string, we still have to wait for the 
development of a nonperturbative formalism before any definitive statement 
can be made concerning the true vacuum of the theory. 

In this chapter, we will thus assume that the compactification can be per- 
formed and will discuss two methods of writing the classical vacua for the 
string theory: 


(1) Calabi—Yau spaces. We require that N = | supersymmetry in four dimen- 
sions be unbroken. This simple assumption forces us to consider manifolds 
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with a covariantly constant spinor. This, in turn, forces the six-dimensional 
manifold to be a Calabi—Yau manifold [1, 2]. 

(2) Orbifolds. We compactify on tori divided by the action of a discrete group. 
This allows us to break the gauge group and arrive at different low-energy 
predictions. (Orbifolds are probably special limits of Calabi-Yau spaces, 
although this is not totally clear.) 


Let us begin, however, by taking the zero slope limit of the theory, which 
reduces to D = 10 supergravity coupled to Eg @ Ex super-Yang—Mills theory 
(see Appendix), and make some reasonable assumptions about the breaking 
scheme. Candelas, Horowitz, Strominger, and Witten [1, 2] made the following 
assumptions about the zero slope limit: 


(1) That the 10-dimensional universe has compactified down to a product of 
a four- and a six-dimensional universe: 


Mj => M, x Ke, (10.1.1) 


where the M, universe is a maximally symmetric space, i.e., 


R 
Ruvagp = [p (Sua Bu “F BupSrva) (10.1.2) 


and K is compact. (The assumption that the four-dimensional manifold is 
maximally symmetric restricts the manifold to be either de Sitter, anti-de 
Sitter, or Minkowski.) 

(2) That an N = | local supersymmetry remains unbroken and has survived 
the compactification. 

(3) That some of the bosonic fields can be set to zero: 


0, (10.1.3) 


The second assumption about NV = | supersymmetry, in particular, is espe- 
cially important because it places nontrivial constraints on the structure of the 
manifold K°. If supersymmetry is unbroken, then the supersymmetry genera- 
tor Q must vanish on the vacuum |0) (see (8.8.4)). The variation of a fermion 
field, under supersymmetry, is given by 


sy = (20, WI). (10.1.4) 


The vacuum expectation value of this equation vanishes if supersymmetry is 
preserved (since Q annihilates the vacuum): 


(0| dy |0) = 0. C1Ou105) 


But in the classical limit, the variation of a fermionic field and its vacuum 
expectation value are the same: 


du ~ (0| Sy |0) . (10.1.6) 
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Thus, if N = | supersymmetry survives compactification, then the variation 
of the fermion fields must be zero: 


by =0. (10.1.7) 


This, in turn, places nontrivial restrictions on the supersymmetry parameter €. 
Originally ¢ was an arbitrary spinor field. However, demanding that VN = | 
supersymmetry be exact, means that we will select a subset of the infinitely 
many possible ¢ such that N = | supersymmetry is preserved. 

Now, the variation of the D = 10 fermions is equal to [3] 


Si Dee rs ale — 96/7" )e Hig + - 
= aaa Fie (10.1.8) 
64 = ——(I - 0¢)e + ——— ame ey +e, 


where the Roman letters i, j, k, a, b, c are six-dimensional indices and we 
drop higher four-fermion interaction terms. In addition, we have the Bianchi 
identity, which is satisfied exactly: 


dH =TrRAR-—ZTrF AF. (10.1.9) 


Now let us impose the second and third conditions, which will place 
restrictions on our parameter ¢. Our variations reduce to 


I 
OW; = — D;é == (0), 

kK 
a be), (10.1.10) 


These are highly nontrivial constraints on the system, especially the first 
statement that forces ¢ to be a covariantly constant spinor, 1.c., De = 0. 
In particular, it places enormous constraints on the spin connection for the 
covariant derivative and hence the space itself. 

The first equation in (10.1.10), for example, states that the parallel transport 
of the spinor € through a distance leaves the spinor invariant. Furthermore, 
we can perform two such displacements and travel around in a closed path. 
For example, if we differentiate (10.1.10) again, we obtain the variation of the 
spinor around a closed path. We find 


Dé =0-> [D;, D,\e =o 
—> Riles = 0. (O19) 


This implies that the spinor remains unchanged when we make a displacement 
around a closed path. This, in turn, implies that the manifold K is Ricci flat: 


R,; =0. (india) 
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This last condition, in particular, is important because it says that the metric 
tensor describes a flat four-dimensional Minkowski space. Thus, de Sitter and 
anti-de Sitter spaces are ruled out. 

Now let us take an arbitrary spinor and make a parallel transport around a 
closed curve. A spinor after making a closed circuit is given by 


ec’ > e+ A™D,,, Dale, (10.1.13) 


where the area of the closed path is proportional to A’”. Thus, we see that 
a spinor simply rotated from its original orientation, with the rotation matrix 
being the commutator of two displacements, which is the curvature tensor: 


é€— Ue, (10.1.14) 
where 
USA Dae Dale, Cet) 


Now let us make several consecutive closed paths, each time leaving and re- 
turning to the same point. In general, each time we make an arbitrary number 
of closed paths around the same point, we wind up with the original spinor 
times a small rotation. Thus, the set of all such rotations forms a group: 


&€ > U,U\é = U3e. (10.1.16) 


This group is called the holonomy group. 

Now let us apply this result to our special case. For the manifold K°, an 
arbitrary spinor has a spin connection that is an O(6) gauge field. A spinor 
transforming under O(6) has 23 = 8 elements. However, we know that 

O(6) = SU(4). Oals17,) 
Thus, these eight elements in a spinor of O(6) can also be rearranged according 
to SU(4): 
8=404. (10.1.18) 
Under SU(4), these two objects transform as spinors of opposite chirality. We 
can let the spinor ¢ have positive chirality. This eliminates half the components, 
so that it transforms as the 4 of SU(4). 


The spinor that we are investigating, however, is not an arbitrary spinor but 
satisfies the condition 


De=0 (CLOMT9} 
which means that 

c= Ue, (10.1.20) 
that is, the spinor remains the same after being transported around a closed 


path. 
The question now is: What is the subgroup of O(6) that leaves invariant the 
4 of SU(4)? The answer is a familiar one, taken directly from the theory of 
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Higgs breaking. There, we also want to know the answer to the question: What 
is the largest group that will leave a constant spinor or vector invariant? 

The answer this question, note that we can always, by an SU(4) 
transformation, put ¢ in the form 


0 


&>ée= : ; (10.120) 


€0 


So far, we have done nothing. We have simply taken an arbitrary spinor and 
put it into this form by an SU(4) rotation. But now it is obvious that the largest 
group U that will keep such a spinor invariant is the subgroup of 3 x 3 complex 
matrices within SU(4), that is, SU(3). Notice that we now restrict the U matrix 


to be block diagonal: 
SU 0 
U= ( he ; i, (10.1.22) 


We can now trivially satisfy ¢« = Ue. It is important to note that our result 
is quite general. We chose a particular representation of € only to show this 
general result, which works for any representation of ¢. 

In conclusion, we say that the existence of a covariantly constant spinor 
forces the holonomy group to reduce from O(6) to SU(3). We say that K° has 


SU(3) holonomy. 
The logic that we followed can be summarized as 
N = 1 SUSY —> D;e = 0 > SU(3) Holonomy. (10.1-23) 


This is an important result, but it is rather useless. Very little is known about 
manifolds with SU(3) holonomy. In fact, none are known explicitly. Therefore, 
this important piece of information cannot be used for phenomenology. How- 
ever, there is still hope, because we have not yet exhausted the information 
that can be derived from our principles. 

We have not by any means exhausted all the implications of having a co- 
variantly constant spinor in the theory. We can, for example, always construct 
the following object out of the spinor field. which transforms as a true tensor 
on the space: 


Jj = —ig"éTyje. (10.1.24) 
Using a few identities on spinors, we can show that 


In JP = —8?. (10.1.25) 


Tibet 
Whenever we can construct such a tensor J that maps the space into itself 


and satisfies J/° = —1, we say that the manifold is almost complex. (In two 
dimensions this tensor statement simply says that there exists a number i such 
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that i? = —1, which is trivial.) This tensor is a very interesting object if the 
spinor € is covariantly constant. For example, by differentiation we find 
|B ae Be SU (10.1.26) 


This means that the metric, in addition to being Ricci flat, is Kahler. (These 
terms will be defined shortly.) Lastly, we can also define a one-form: 


Pea Pinas, (1001-27) 
where IZ, is the Christoffel symbol. We find that this form satisfies 
di=0 (10.1.28) 


which states that the first Chern class vanishes: c; = 0. 
In summary, retracing the logic of our assumptions, we have 


N = 1 SUSY > D;e =0 — K is Ricci flat, 
Kahler, with vanishing first Chern class. 
(10.1.29) 


The great advantage of this new result is that many Ricci-flat, Kahler man- 
ifolds with vanishing first Chern class are known. Thus, instead of relying on 
SU(3) holonomy, which is a dead end, it is much preferable to rely on these 
types of Kahler manifolds. 

One question that remains unresolved is the relationship between these two 
kinds of manifolds. Fortunately, Calabi conjectured (and Yau later proved) the 
statement that [4-6]: 


Theorem (Calabi-Yau). A Kahler manifold of vanishing first Chern class 
always admits a Kahler metric of SU(3) holonomy. 


Thus, using the theorem of Calabi-Yau, we can generate (at least in principle) 
thousands of six-dimensional manifolds for phenomenological purposes. 

In order to actually construct such Calabi-Yau manifolds, it is important to 
first review a few elementary facts about algebraic geometry and cohomology. 
Let us now make a digression and discuss some simple properties of Kahler 
and Ricci-flat metrics in the language of cohomology. We will find that many 
of these results from cohomology theory can be incorporated directly into 
superstring phenomenology. 


10.2 Review of de Rahm Cohomology 


As stated in the Appendix, the theory of differential forms begins with an 
operator that is nilpotent: 

dare On, 

d* =0. (10.2.1) 


410 10. Calabi-Yau Spaces and Orbifolds 


We say that an N-form a is closed if 
closed: dw =0. ClO) 
Similarly, we say it is exact if there exists an N — | form @ such that 
exact m= da. (10.2.3) 
Notice that the set of forms is a subset of the closed forms: 
exact forms C closed forms. (10.2.4) 


In three dimensions, these statements can be summarized by the well-known 
statement that the gradient of a scalar always has zero curl: 


A=V-¢ > VxA=0. (10:2:5) 


In fact, the theory of forms allows us to simply reexpress many of the theorems 
in ordinary tensor calculus in three dimensions. 

In Maxwell theory, we say that two field tensors A, and B,, describe the 
same physics if they differ by a total derivative: 


Ap = B,=0,h- —>- Agee. (10.2.6) 


This, of course, is the essence of gauge theory. In this mathematical language, 
we will say that two forms w and a’ belong to the same equivalence class if 
they differ by a closed form. 


/ , 
w-w=da > oro. 


In Maxwell theory, we want to construct the set of all inequivalent fields. In 
the theory of forms, we do this by defining the set H’(M) to be the set of all 
closed p forms modulo exact forms, i.e., 

FE ig peroneal (10.2.7) 
exact p forms 
This defines the pth de Rham cohomology group H'(M) defined on a 
manifold M. 

Notice that the cohomology group counts how many independent p closed 
forms can be defined on a particular manitold. Cohomology is thus dependent 
on the /oca/ structure of the manifold. However, as the name suggests, there 
is a duality between cohomology and homology, which is based on the global 
properties of a manifold: 


cohomology — local properties of a manifold, 
homology — global properties of a manifold. 
To make the relationship between homology and cohomology more precise, 


let us define how to establish this duality. We begin by writing the integral over 
a region C defined on a manifold M. where C can be a line, surface, volume, 
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i 


We can think of this as a map that takes a p form w and marries it with a surface 
C to produce a real number. Thus, we can view this operation as the “scalar 
product” of a form with a surface: 


etc., 


(C|w) =fe. (10.2.8) 
c 


Now let us write the familiar Stokes’ theorem, generalized to N-dimensional 
space and expressed in the language of forms: 


[a= | o (10.2.9) 


where C is an N-dimensional manifold and dC is defined as the “boundary” 
of C. Now let us rewrite Stokes’ theorem so that it is reexpressed as a scalar 
product equation: 


Stokes’ theorem: (C|dw) = (Clo). (10.2.10) 


Thus, we have established that the dual of the cohomology operator d is equal 
to 0, the boundary operator: 


d<0. | (10.2.11) 


Because of this duality, we suspect that the boundary operator 0 also defines 
the dual to the cohomology group, which we call homology. 

To be precise, let us clarify what we mean by the boundary operator 0 and by 
the surface C in terms of simplexes. Let us define the line segment that extends 
from a point p; to pz as a 1-simplex. The |-simplex has a definite orientation 
or direction: 


lineseepmient= [p,, po] = —[p2. pil. CLOrZA2 } 


Let us define a triangle as a 2-simplex, which is labeled by three points or 
vertices: 


triangle = [p1, p2, p3] = —[P1, Ps. Pz]. (10.2.13) 


Notice that the 2-simplex is cyclic in the three points, but flips sign if the 
points are arranged anticyclically. Obviously, we can generalize this concept 
by introducing an N-simplex: 


WN -simplex = [(pji,.+..Py+i |- (10.2.14) 


(Notice that the vectors formed by taking differences of the points p; must be 
, linearly independent, or else the simplex collapses to a lower dimension.) 
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Now let us define the boundary operator that maps m-simplexes into (m — 
1)-simplexes. For example, 


d\[P1, P2] = [pi] — [pal, (10.2.15) 


where [p] is a point. Thus, we define the “boundary” of a line segment to be 
the two endpoints. Acting on a triangle, the boundary operator creates line 
segments: 


d[P1, P2, P3] = [Pi. P2] fe [p2, pP3] af [P3, pil. (10.2.16) 


Thus, the boundary operator acting on a 2-simplex (a triangle) simply breaks 
it up into its three edges, which are composed of lines or |-simplexes. 

It is easy to check that the boundary operator is nilpotent. For example, for 
the simple case of triangles we have 


0102[P1, P2, P3) = O{[pi. P2] +[p2, p3] + [p3. pil} 


= [pi] — [p2] + [p2] — Lp3] + [ps3] — [pi] 
=0. (10.2.17) 


Thus, we have the important result that the boundary operator is nilpotent: 
a =40, 


This can be generalized to the arbitrary case. Let us now define how the 
boundary operator acts on an N-simplex: 


N+1 
Ov[Pi.---» Pri] = > (-1)'[pi.-.-. ee pil (10.2.18) 
f=1 


where we omit the points that are hatted. Thus, the boundary operator maps 
N-simplexes into (N — 1)-simplexes. It is not difficult to show that 


On-10n = 0 (10.2.19) 


for the general case. 
In analogy with the theory of homology, let us define the set of cvc/es Z to 
be the set of simplexes that obey 


IZ 0. (10.2.20) 


For example, take the two-dimensional doughnut or torus. There are several 
types of cycles that we can draw on this surface. For example, in Chapter 5 
we saw that we can draw two types of cycles on the doughnut that cannot be 
continuously shrunk to a point. However, there is also the cycle that is simply 
a closed line on the surface of the torus that can be continuously shrunk to a 
point. We want a method by which we can eliminate the second type of cycle. 

Let us define the set of boundaries B as the simplexes that can be written 
as the boundary of some higher simplex C: 


B= ac. (10.2.21) 
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Notice that the set of boundaries is a subset of the set of cycles: 
boundaries C cycles. (10.2.22) 


We say that two simplexes are in the same equivalence class if they differ only 
by a boundary. Thus, we want to extract the set of all inequivalent simplexes, 
and we do this by defining the pth homology group: 


p-cycles 


H,(M) = (102-23) 


p-boundaries 
This allows us to eliminate the redundant cycles on the torus that can be con- 
tinuously shrunk down to a point, keeping only the two independent cycles 
that encircle the torus. Thus, the concept of homology is a natural one. 


10.3. Cohomology and Homology 


What is the relationship between the homology and cohomology groups for a 
compact manifold? Because of the duality that exists between them, we can 
show that they have the same dimension. Let us define the Betti numbers to be 
the dimension of the H’s: 


b? = dim H?(M) = dim H,(). (10.3.1) 


Let c, bea set of cycles defined on a compact manifold M. Let w, be a set 
of closed forms defined on M. Then construct the matrix Q2 po: 


OG. Oo, = ‘| Wy. (10.3.2) 
The matrix Q is called the period matrix (see (5.6.31) and (5.11.8)), and it can 
be shown under quite general conditions that it is an invertible matrix. Thus, 
itis an N x N matrix with nonvanishing determinant. But if the period matrix 
is a square matrix in N dimensions, then the dimension of the closed forms w, 
is equal-to the dimension of the cycles c,, which proves that the Betti numbers 
are the same for the cohomology and homology groups. This is an extremely 
important result, because it means that we can use either local (cohomology) 
or global (homology) properties of a particular manifold to calculate its Betti 
numbers. 

It is also important to realize that the Betti numbers are topological num- 
bers, depending only on the overall topology of a manifold. Thus, any linear 
combination of the Betti numbers is also a topological number. In particular, 
the most important one is the Euler number: 


100=— >) ee (10.3.3) 
i=0 


414 10. Calabi~Yau Spaces and Orbifolds 


To understand the properties of these Betti numbers, we will have to intro- 
duce a few more operators, including the Laplacian. Let us also introduce the 
Hodge star operator, which converts a p-form into an (N — p)-form: 

#(dx"' A dx’? A.» A dx'?) 
= VB isin _ axle! - dxir? A ~-» A dx" (10.3.4) 
(n oar p)! Ept tl pt2° ln 
where &;j,.. is the totally antisymmetric tensor in NV dimensions. Notice that 
N- 
**@, =(-1)"" 0, 
Wp N¥Wg = Wg AN *Wp. (1073.5) 

In N-space, the product of a p-form and an (N — p)-form yields an N- 

form, which is proportional to the volume d” x. Thus, we can integrate over 


the product of a p and an (N — p) form to obtain an ordinary number. Let us 
now define yet another inner product: 


Chee [ eee (10.3.6) 


Now that we have introduced the definition of the scalar product, this allows 
us to define the dual 6 of the derivative d: 


(@p | dBp-1) = (Sap | Bp-1). (10.3.7) 
Explicitly, the adjoint of d is given by 
SS (—1) a eds. (10.3.8) 
We also have 
jo 10: (10.3.9) 


Notice that the adjoint reduces by | the degree of a differential form, while d 
raises it. 
Let us now define the Laplacian as 


A =(d+6y =8d+4+d5. (10.3.10) 
Let us define a p-form w to be harmonic if 
harmonic: Aw = 0. (10.3.11) 
Let us define a p-form to be coclosed if 
coclosed: 6a=0. LORS EDS) 
Let us define a p-form to be coexact if it can be written as 
coexact: w= da (10.3.13) 


for some (p + 1)-form a. 
We now arrive at an important theorem: 
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Theorem (Hodge). On a compact manifold without boundary, any p-form 
can be uniquely decomposed as the sum of exact, coexact, and harmonic forms, 
Le, 

Wp = Ady) + OBy41 + Vp, (10.3.14) 
where y, is a harmonic form. 

This is a powerful result, because we can show that each cohomology class 
contains only one harmonic form. To see this, first construct the inner product 
between a form and its Laplacian: 

(w | Aw) = |6a|* + |da|’. (10.3.15) 


Thus, the statement that a form is harmonic (which sets (w | Aw) = 0) 
is equivalent to stating that it is exact and coexact (because |Sw|? = 0 and 
|dw|? = 0). In fact, it can be shown that 


ho i 6) =a (10.3.16) 

But if dw = 0, this means that dB = 0, and so 6B = 0. Therefore (10.3.14) 
reduces to 

w=daty. (10:3217) 


Thus, every cohomology class contains one harmonic representative. 

The fact that there is one harmonic representative within each equivalence 
class of exact forms gives us an alternative way to define the Betti numbers. 
We can also say that the Betti numbers count how many independent harmonic 
forms there are on the manifold. We have the following equivalent description 
of the Betti numbers: 


number of independent closed forms, 
Betti number = { number of independent cycles, (10.3.18) 
number of harmonic forms. 
Thus, we are free to use either of three equivalent formalisms to calculate the 
Betti numbers. 

The last formulation of Betti numbers, in terms of independent harmonic 
forms, gives us an alternative definition. The set of harmonic forms can be 
reexpressed as the kernel of the Laplacian (1.c., the forms that are sent to zero). 
Thus, we can define the Betti numbers as 

dim ker A,, 
Dy = 4 ding, (1053219) 
dim H?. 

We will find it useful to study the properties of some of these Betti numbers. 

First, we can always define the scalar product for an N-dimensional manifold: 


(wy | @v—p) = i WpOn-p- (10.3.20) 
M 
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Consequently, these two spaces contain the same number of independent 
elements. Thus, 


b, = dim H? = by_, = dim Hy_p. (10.3.21) 


This is called Poincaré duality. We will usually take bp = 1, so therefore 
el 

Another way to prove Poincaré duality is to notice that if a particular form 
is harmonic, then it has a dual that is also harmonic: 


Aw=0 —> Axw=0. (homer) 


But because the number of independent harmonic forms is equal to the Betti 
number, and because the Hodge star operation converts p-forms into (VN — 
p)-forms, we must have Poincaré duality. 

Lastly, if we have a product manifold, then the Euler number of the product 
manifold is the product of the Euler numbers of each manifold: 


x(M x N) = x(M) x X(N). (10.3.23) 
If we write this out in terms of Betti numbers given in (10.3.3), this becomes 
by(M x N)= >> b,(M)b,(N). (10.3.24) 

pt+q=k 


These are called the Kunneth formulas for product manifolds. 
Let us now take a few simple surfaces and calculate their Betti numbers. 


(1) Two-Torus 


A two-torus can be cut up into cycles in two independent ways. The dimension 
of the independent one-cycles that we can draw on a torus is thus two. Thus, 
b, = 2. Also, by Poincaré duality, we have 


Ty: | ee | (10.3.25) 
by =e. 
Thus, the Euler number (10.3.3) for a torus is 
x(TIh) =1-24+1=0. (10.3.26) 


(2) Riemann Surface 


As we saw in Chapter 5, there are 2 ¢ ways in which we can divide up a Riemann 
surface of genus g into independent cycles. Each hole or handle has two such 
cycles. Thus, b; = 2g. We then have 


bo Do =1, 
b; = 2k 
Thus, the Euler number for a closed Riemann surface is 


x(M) Siete 1 = 2 — 2p. (10.3.28) 


(10.3227) 
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(3) N-Sphere 


Cycles on the sphere Sy, of course, can always be collapsed down to a point. 
Hence there are no independent cycles that we can write. Thus, 


Dy = Oy =i, 
Sy: 10.3.2 
ss b, =0 otherwise . oe 
The Euler number is therefore 
0 if N is odd 
Sy) = oy 
x(Sw) 2 if N is even. ot) 
In particular, given that 
(ee ate (10:3.31) 


we can use the product rule on Euler numbers to show that the Euler number 
for a torus is 


Ne — (SiO (10.3.32) 


which agrees with our earlier calculation. 


(4) Product of Spheres 
Using (10.3.24), we can show that the product $3 x S3 has 
bo = bg = 1, 
S3 x S3: b; = 2, (103,33) 


bp, =0 otherwise . 


We find that the Euler number is zero, as expected from the fact that the Euler 
number of the three-sphere is also zero: 


x(S3 x S3) = x(S3) x xX(S3) = 0. (10.334) 

Similarly, we can take the product 52 x S» x S), which has the Betti numbers 
. bo = bs = I, 

So x S> x 52: b> = bs => Be (10:3.35) 


b, =0_ otherwise. 


We find its Euler number to be 
- x(S2 x Sp x So) = x(S2)’ = 8. (10.3.36) 


(5) Four-Torus 
We can write the four-torus as 


i T x Th. (103:37) 
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It is a simple exercise to show that the Kunneth relations for product manifolds 
in (10.3.24) yield 


plans 
Te =e (10.3.38) 
bi6. 


Putting these Betti numbers into the Euler number, we find 
x(Ts) = x(T2) x(a) = 9, (10.3.39) 


as expected. 


(6) Six-Torus 
For the six-torus Tg, we have 
T6 — Th x Th x Tr. (10.3.40) 


Therefore we can use the Kunneth relations to solve for the Betti numbers of 
the product manifold. It is easy to show that 


i abe = 1, 

t: ee (10.3.41) 
b= by = 
bx== 20. 


We see that the Euler number is equal to 0: 
au eye (10.3.42) 


This is also true because each circle within 7, has Euler number 0. Using these 
simple rules, we can obviously generate the Betti numbers of 7\.. 


(7) Real and Complex Projective Space 


The space C Py is formed by taking ordinary complex (NV) + 1)-space and 
making the identification 


Zp = AZp (10.3.43) 


for some nonvanishing complex number 4. This last condition reduces complex 
(N + 1)-space to complex projective N-space. C Py is a generalization of real 
projective space Py, which ts created by identifving points in real (NV + 1)-space 
by 


Spl kX» (10.3.44) 


for some nonzero real &. We can also construct it by taking the sphere Sy and 
identifying antipodal points. Some examples of real projective and complex 
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projective space are 


CPi = oa 
Py = SOC) = SUC)/Z2, (10.3.45) 
CPy = SU(N + 1) 
U(N) 
Their Betti numbers are given by 
bg =a 
lene by =0(N even), = 1(N odd), (10.3.46) 
b; = 0 otherwise, 
x(Pv) = 1(N even), =0(N odd), 
and 
Deven = 1, 
CPx: | ee 
Therefore, 
x(CPy)=N +1. (10.3.47) 


10.4. Kahler Manifolds 


So far, our discussion has revolved mainly around real manifolds. However, 
our results generalize quite easily to complex manifolds. 

To define an N-dimensional complex manifold, we want a 2N real manifold 
and a generalization of the usual definition of a complex number: z = x + /y, 


where i? = —1. If we have a 2N-dimensional real space, what we want is a 
2N x 2N tensor field J/ that can replace i, such that 
P=-l (10.4.1) 


as a matrix question, so that we can define a complex number as 
Teed Y: (10.4.2) 


Manifolds that have such a tensor field J are called a/most complex. 
However, we want more than just the existence of the tensor field J. We 
would like to diagonalize it. If we use complex coordinates 


Za = Xg + Dyas 
ae = Xq — L¥z, (10.4.3) 


“we would like to make a change of coordinates such that 
Ups fi (10.4.4) 
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The question is: Can we always diagonalize J in a neighborhood of a point p 
using a coordinate transformation: 


t= 2a (10.4.5) 


(which is not a function of z)? Such a coordinate transformation is called 
holomorphic, which is the generalization of the concept of analytic. For dif- 
ferent points p, we of course will have to patch them together using different 
holomorphic transformations until we can cover the entire surface. 

If we can always find such holomorphic transformations that can diagonalize 
J around a neighborhood of any point p of the manifold, we say that the 
manifold has complex structure and is a complex manifold. 

Thus, we have the analogy 


One dimension JN dimensions 


i J = 18) (10.4.6) 
analytic holomorphic 
=a Ly Desai 


(Intuitively, the process of patching together neighborhoods where J 1s di- 
agonal resembles the “elevator problem” in general relativity. By a general 
coordinate transformation, we can always locally transform the Chnstoffel 
symbols to zero at any specific point, which corresponds to falling freely in 
an elevator without gravity. In general, we cannot globally transform all the 
Christoffel symbols away entirely, or else the space is flat, but we can at any 
point on the manifold enter the elevator frame.) 

At first, this definition might seem verbose for a rather intuitive concept. 
However, certain 2N-dimensional real manifolds can be shown to fail this 
criterion. For example, the spheres 52, are not complex manifolds (except for 
S», because S$; = CP,;). Thus, it is not at all obvious that a 2N-dimensional 
real manifold can be rewritten as an N-dimensional complex manifold. 

Now let us discuss differential forms on these complex manifolds. Let us 
complexify our basis differentials as 


dz! = dx! +idy!. (10.4.7) 
Now the concept of a p-form can be generalized to a (p. q)-form: 
WO = Wijpuijbuig 2 Adz! -+-dz" \ dz Adz! ««- dz", (10.4.8) 


with p unbarred and qg barred indices. We can now define two exterior 
derivatives: 


ie ee Por all i} su 
dw = Aol bik si Thi dc Adz --+ Kd, 
, 


2 Cae a 
Ow — aI AZ 0) jp j§- GAZ! Neer A dian (10.4.9) 
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The properties of these two derivatives are 


a — a = 0, 
09+ 00 =0, (10.4.10) 
d= 4(0+ 4). 


Every form now takes two indices (p, q) to label it. Thus, we can now define a 
new cohomology group. Instead of de Rahm cohomology based on d, we now 
have Dolbeault cohomology, based on d. We can define closed and exact for a 
(p, q)-form just as before, except we now use @ instead of d: 
FFL 1 aca (10.4.11) 
d exact (p, g)-form 


As before, we can also construct the adjoint operator and the Laplacian. In the 
complex case, we actually have two Laplacians: 


A, = 00' + a°9, 

A; = 0a‘ + fa. (10.4.12) 
We now have the complex version of the Hodge theorem: 
Theorem (Hodge). Every complex (p, q)-form has a unique decomposition 

o=a+dp+d'ty, (10.4.13) 
where a is a harmonic form 
Ana = 0 

and B, y are (p,q — 1)- and (p, q + 1)-forms, respectively. 


We also have the generalization of the Betti numbers, which are now given 
by 


dim H}7(M) = h?4, (10.4.14) 
The relation to the Betti numbers is 
- nee ee (10.4.15) 
p+q=n 


We also have, by complex conjugation, 
ped — Ar 


and Poincaré duality 
ee fe (10.4.16) 


We now define a Kahler manifold. If we have a complex manifold with 
Hermitian metric g;;, then we can always construct the form 


Q = gjdz' Adz’. Gang) 
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This is called the Kahler form. A complex manifold is called Kahler if 
Ch Mm Uf (10.4.18) 


A Kahler manifold has a Kahler form that is closed. 

A Kahler manifold, because of the above definition, has a large number 
of beautiful properties which make it among the most elegant of complex 
manifolds. We simply list some of its properties: 


(1) We can show that the Hermitian metric of a Kahler manifold can be written 
in terms of a derivative of a single function, the Kahler potential ¢, 


=. 10.4.19 

815 = Bziagi ( ) 

(2) It is easy to show that a Kahler manifold satisfies, by explicit calculation, 
207 = Ap =e (10.4.20) 


This is a powerful identity because it states that the various Laplacians 
that we can form on a Kahler manifold are all the same. Thus, there is no 
confusion when using different Laplacians for the Kahler manifold. 

(3) If a manifold is to admit a Kahler metric, then the even components of 
the Betti numbers must be greater than or equal to one, and its odd Betti 
numbers must be even: 

Kahler > ea (10.4.21) 
2n+1 = ven, 
for integers p and n. (This simple criterion rules out a large class of mani- 
folds. For example, this rules out 53 x 5; as a manifold that admits a Kahler 
metric, because b, = 0. It also rules out Py but allows C Py.) 

(4) If the J) tensor is covariantly constant, then the metric is Kahler. The 

converse is also true for complex manifolds. 


V° J? = 0 < Hermitian metric is Kahler. (10.4.22) 


(5) Ifthe torsion form that we construct on a complex manifold vanishes, then 
the metric is Kahler. 
(6) The only nonvanishing Christoffel symbol of a Kahler manifold is 


ip Ooeeaeh- (10.4.23) 
The only nonvanishing component of the curvature tensor is 
Rive = Merc (10.4.24) 


and other curvature components related by symmetry and complex 
conjugation. The contracted Ricci tensor becomes 


d7(In det g) 
az2az> 


in) = 


(10.4.25) 
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The two-form 
R = Rj dz" dz” (10.4.26) 


is closed: dR = 0. 

As a direct consequence, we can show that Kahler manifolds have U(N) 
holonomy. In fact, this can be used as an alternative definition of Kahler 
manifolds. Notice that the holonomy group is the rotation group generated 
by making closed paths around the manifold, which is a function of 


(7 


— 


[Dz, Dp] ~ RY,Mi;, (10.4.27) 


where M;; is the rotation matrix for some 2N-dimensional tangent space 
group. The coefficients of this rotation, we see, are functions of the anti- 
symmetric Riemann tensor R,»,~, which can now be viewed as an element 
ina 2N x 2N antisymmetric rotation matrix. Thus, in general, we have 
SO(2N) holonomy for a general manifold. However, if the manifold is 
Kahler, we can show that the restriction on the curvature tensor we found 
earlier reduces the SO(2N) rotation matrix down to U(N), which is a 
subgroup of SO(2N). The restriction that (10.4.24) is the only nonzero 
component of the curvature tensor breaks SO(2N) symmetry. 

(8) Kahler manifolds, which have U(\) holonomy, can be further restricted if 
we demand that they have vanishing first Chern class. In this case, U(V) 
holonomy reduces to SU(N) holonomy. In fact, we have the theorem of 
Calabi-Yau, which works for all NV, that a Kahler manifold of vanishing 
first Chern class always admits a Kahler metric of SU(N) holonomy. In 
fact, it can be shown that the vanishing of the first Chern class is equiv- 
alent to having a Ricci-flat metric. Thus, we will use these two concepts 
interchangeably. 


It will now be instructive to consider explicit examples of Kahler manifolds. 


(1) Riemann Surface 


Any oriented Riemann surface admits a Kahler metric. Because the line 
element of any Riemann surface can be put in the form 


ds* = edz dz, (10.4.28) 
then it is Kahler because we can always find a Kahler potential @ such that 


——$ =e’. (10.4.29) 


Riemann surfaces are trivially Kahler because any two-form on a Riemann 
, surface, including the Kahler form, is closed. 
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(2) Complex N Space 


Complex N space Cy is trivially Kahler because its standard line element 


ds — laa (10.4.30) 


I 


can always be put into Kahler form. 


(3) Sphere 


We note that Sy, in general, does not admit a Kahler metric. This is because 
the even Betti numbers 5), for p equal to a nonzero integer are all usually 
equal to zero. Hence, they cannot be greater than or equal to one, which was 
one of the conditions on a Kahler manifold. The exception is the two-sphere 
S, which has no even (nonzero) Betti numbers. To show that the two-sphere 
admits a Kahler metric, notice that a two-sphere has the line element: 


dx? + dy’ 


2a 
From this, we can write the Kahler form 
dz A dz 
= (je 10.4.32 
2! (1 + 2z) ( ) 


if we put it into complex form. This Kahler form is exact, and hence the space 
admits a Kahler metric. Although S$ is Kahler, we can show that c; # 0. 
therefore it is not Ricci flat. 

We notice that S,, x S, is a complex manifold if p and qg are odd integers. 
However, this does not mean that they are Kahler. In fact, 8; x S; and S; x Ss 
are both complex manifolds, but they are not Kahler. However, $; x $3 x S3 
is Kahler (but is not Ricci flat). 


(4) Complex Projective N-Space 


To prove that C Py is Kahler, we note that it is possible to write the metric as 
a Fubini-Study metric on C Py: 


N 
Q = 419d In ( +r), (10.4.33) 
I 


(5) Complex Submanifolds of C Py 


It is trivial to see that complex submanifolds of C Py are also Kahler. In fact, we 
use precisely the metric defined for C Py for the submanifold, except we must 
be careful to take only the components of the metric tensor that are tangent 
to the submanifold. Since the original metric was Kahler, the metric on the 
submanifold (which is the same metric) must also be K4hler. 
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(6) Torus 


The two-torus 7) actually has vanishing first Chern class c; = 0. We can also 
show that the four-dimensional torus 7, is Kahler. Finally, we can show that 
the six-torus 7; is both Kahler and Ricci flat. Thus, compactification on the 
six-torus appears to have the desirable feature that N = | supersymmetry is 
preserved. However, the drawback of the six-torus is that too much symmetry 
is preserved. In fact, N = 4 supersymmetry is preserved on 7s, making it 
unacceptable from the point of view of phenomenology. 
Let us now summarize some of these results in a table: 


Manifold Complex Kahler R-Kahler x 
S> WW vi N 2 
Son—1 N N N 0 
Sen N N N 2 
(53)? VW N N 0 
AY x Ss se N N 0 
(5) BY: We N 8 
S| x Sant ny’ WY NY 0 
> We ny Ww 0 
Ts Yi nya W 0 
(CUR BY \¢ N 4 

C Py Y W a M+1 


where R-K&ahler means Ricci flat and Kahler, Y, (N) means yes (no), x equals 
tie Eeler number, andn = 12,375. 

In addition, the condition of being Ricci flat places even more restrictions 
on the Kahler manifold. For example, it can be shown that a Kahler manifold 
in three complex dimensions has c, = 0 if and only if there exists a covariantly 
constant nonvanishing holomorphic three-form w. This shows, for example, 
that S, x $> x S> does not admit a Ricci-flat Kahler metric. (We know that this 
manifold has b; = 0. Thus, by definition, there are no harmonic three-forms on 
this manifold. But this also means that there are no holomorphic three-forms, 
either. Thus, it cannot admit a Ricci-flat Kahler metric.) 

Another simple consequence of this is that any harmonic (p, 0) in three 
complex “dimensions can be multiplied by w, thus creating a harmonic 
(0, 3 — p)-form: 


co=0 > hh? = po3-?, (10.4.34) 


This identity is true because w is covariantly constant and hence acts like 
a constant under the Laplacian. Thus, a harmonic form remains a harmonic 
form after multiplication by w. Also, a (p, 0)-form becomes a (0, 3 — p)-form 
because we are contracting with the Hermitian metric tensor g,,; and not gap. 

This, in turn, allows us to eliminate almost all the various Hodge coefficients. 
By using the various reflection symmetries and the previous symmetry, we can 
show that only h!:!, h?:', A!° survive as independent components of a manifold 
with SU(3) holonomy. Of these, we can also eliminate h'° for the following 
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reason. We know that the Laplacian can always be expanded out as 
Aq = —V* + curvature terms. (10.4.35) 


Acting on a one-form, the various curvature tensors reduce to the Ricci tensor. 
But the Ricci tensor is zero on a Ricci-flat manifold. Thus, a harmonic one- 
form must also be covariantly constant. This means that the Betti number of 
a harmonic one-form must be zero: b; = 0. But it also means that hi = 0 
by (10.4.15). Putting everything together, we can now show, for a Ricci-flat 
manifold, 


f= Oly 2 ee (10.4.36) 
p.g=n 
This last identity for a Ricci-flat metric will become extremely important when 
we discuss the generation problem. It turns out that A'"' and h~"! are related to 
the number of positive- and negative-chirality fermions we can define on the 
manifold, so that the above equation states that the generation number is half 
the Euler number 


generation number = |x|. (10.4.37) 


Thus, we have a topological derivation of the generation number! 


10.5 Embedding the Spin Connection 


Armed with some of these elementary results from algebraic topology, Jet us 
now return to string phenomenology and apply these results. 

Earlier, we saw that the condition of N = | supersymmetry implied the 
existence of a covariantly constant spinor, which in turn implied that the six- 
dimensional manifold K was Kahler, Ricci flat, and had SU(3) holonomy: 


N = | supersymmetry — covariantly constant spinor 
— Kahler, Ricci flat, SU(3) holonomy. 


Now we wish to exploit the remaining condition, which is the Bianchi identity 
(TOa1E9): 


TrRAR=aTrF a F. (10.5.1) 


This is actually a strange identity, because we have the Riemann tensor on the 
left and the Yang—Mills tensor on the right. The equation used to be an exact 
identity, devoid of any content, but in the presence of our assumptions 


GG, (10.5.2) 


we find that the equations are actually quite difficult to satisfy. Thus, the 
preservation of this identity is nontrivial because the system is overconstrained. 

One attractive way to satisfy this odd identity is to set part of the Ex ® 
Ex gauge fields equal to the Riemann spin connections, which have SU(3) 
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holonomy. This produces a nontrivial link between the spin connection and 
the Yang-Mills gauge field. We can perform this embedding as follows on the 


gauge field: 
0 0 
A= ( alt (10333) 


where w is the spin connection, which occupies part of the gauge field matrix. 
Thus, to embed the spin connection field into the Yang—Mills gauge field, we 
need to find a subgroup of the gauge group that contains SU(3). This means, of 
course, that we are breaking the original gauge symmetry of the Yang—Mills 
field. The simplest decomposition is 


Ex ® Ex > SU(3) @ Es ® Es. (10.5.4) 


We must check, however, that the Clebsch—Gordon coefficients are such that 
the Bianchi identity is satisfied exactly. In particular, we must show that we 
can obtain the factor of 4 in the Bianchi identity (10.1.9). 

We know that the Yang—Mills gauge fields are in the adjoint representation 
of Eg, which has 248 elements. We must now find a breakdown of these 248 
elements into representations of SU(3) ® E¢, which is always possible to do. 
We find 


248 = (3, 27) @ (3, 27) @ (8, 1) @ (1, 78). (10.5.5) 


To see if (10.5.1) is satisfied, let us convert from the adjoint representation 
of SU(3) to the 3 @ 3. If A is a generator of SU(3), then we wish to find the 
relationship between the trace of this matrix squared in the 8 representation 
and the 3 @ 3 representation. The answer is 


Tn(ag) = 3 Its 5). (10.5.6) 


Let us now focus on the SU(3) content of (10.5.5). Notice that we have 27 sets 
of fields transforming in the 3 @ 3 representation of SU(3), and also one set 
of octets. But the sum of the trace of the squares of the octets, as we saw in 
(10.5.6), must be multiplied by 3 when we convert to the 3 © 3 representation 
of the SU(3) matrices. Thus, the total redundancy in the 3 @ 3 representation 
is equal to 


2) + 3x l= 30 


for a total factor of 30. Because of spin embedding, where the curvature tensor 
is defined in the same space as the Yang—Mills tensor, we can now satisfy 
(10.5.1) because the missing factor of 30 emerges when we count the specific 
representations of SU(3) in (10.5.5). 
This breakdown to SU(3) x Es is a good one for phenomenological reasons, 
because FE, has been extensively explored for GUT model building. 
The original Eg group, by contrast, does not have complex representations, 
which are needed to describe chiral fermions, but E, does. The 27 multiplet, in 
fact, is precisely the favored multiplet for fermions for model building with E,. 
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The group E, is also good from the point of view of low-energy supersymmetry, 
because the 27 of the fermions can form a supersymmetric multiplet with the 
27 of the Higgs. 

In summary, we have the phenomenologically acceptable conclusion 


rk — a Tr F A F > spin embedding — 27 fermions. (10.5.7) 


Because the fermions are now in the 27 representation, we can ask the 
question: How many generations are there? The GUT, as we saw, was plagued 
with the problem of generations. There was no reason for assuming more than 
one generation. 

In the superstring picture, the situation is precisely the opposite. We will 
now show that we get too many generations! 


10.6 Fermion Generations 


One of the most powerful applications of algebraic topology to the phe- 
nomenology of string theory is the calculation of the generation number purely 
from topological considerations. 

To calculate the number of generations predicted by the theory, we must first 
calculate the number of massless particles. The 10-dimensional Klein—Gordon 
operator for the particles in question becomes, after compactification, the sum 
of two Klein—Gordon operators in four and six dimensions: 


Ohov = (O,+ O6)v = 0. (10.6.1) 

In general, LO, will have eigenvalues denoted by m-, that is, OoW, = mm? Wm, 
so that our wave equation becomes 

(C4 + m7) = 0. (10.6.2) 


We are interested in the massless sector in four dimensions, so we want to keep 
only the zero eigenvalues of the LJ, operator. Thus, 


Chy = Oy =0. (10.6.3) 


This is an important equation because it has two interpretations. First, it means, 
of course, that the four-dimensional fermions are massless. Second, it also 
means that y is a harmonic form in six dimensions. Therefore the number of 
massless modes in four dimensions will be related to the number of harmonic 
forms that we can write for the six dimensions manifold. We saw earlier in 
(10.3.18) that the number of harmonic forms of degree p is equal to the pth 
Betti number. Thus, topological arguments alone should give us the number 
of generations! In summary: 


, y is harmonic in six dimensions, 
m’=0Q-> ; 
y is massless in four dimensions. 
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We expect that the number of generations is a topological number because 
of the Dirac index theorem. We know that the solutions of the Dirac equation 
can, in general, have zero modes: 


i(D)W = 0. (10.6.4) 


In fact, the index of this operator is equal to the difference between the positive 
and negative chiralities of the zero modes: 


Index(P) =n, —n_. (10.6.5) 


The Dirac index is a topological quantity defined on a spin manifold, so we 
expect that it can be related to the characteristic classes we found earlier. In 
fact, when discussing SU(3) holonomy, we will find that 


Index(P) = }|x(M)|. (10.6.6) 


But the Dirac index is also equal to the generation number, because we will be 
considering only fermions of one specific chirality. Thus, the precise relation 
between the generation number and the Dirac index, or the Euler number, 
is 


generation number = $|x(M)|. (10.6.7) 


To see this, consider a manifold of SU(3) holonomy, with Betti (Hodge) 
numbers that have two indices p, q. The Euler number can be written as 


3 
x(M) = > (1) hg. (10.6.8) 


P.4 
The multiplicities of the supergravity and Yang—Mills multiplet can be deter- 
mined by calculating the number of harmonic forms we can write down, which 
in turn is related to the Betti numbers. We find, if we compare the helicity of 
supersymmetric pairs with their multiplicity, 
(2, 3) hoo, 
(3, 1) hot, 
(1, 5 (2h1.0 + ho,1), 
(5,0) (hoo + 11 + h2,1). 


(10.6.9) 


If we analyze the spin-+ fermion sector, given by G. 0), we find that their 
multiplicity number equals 


fermion multiplicity = hoo + 44,1; + ho.. (10.6.10) 


This multiplicity, however, is too large. We want only the subset of this figure 
corresponding to the 27 and 27 of the spin-+ fermions. 

Before, in (10.5.5), we found that the 248 of Ex ® Es can be decomposed 
into (27, 3) and (27, 3). It can be shown that the multiplicity associated with 
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each of these two representations is equal to 
(27,3) hoi, 
(27,3) hy.. (10.6.11) 
Thus, the generation number is equal to 
#(27) — #(27) = hoy — haa, (10.6.12) 


where # represents the multiplicity of the 27 representation. But this number, 
in turn, is precisely half the absolute value of the Euler number, as we saw in 
(10.4.36). 

The relation between the generation number and the Euler number is rather 
surprising because there is no reason to believe that any relation should exist 
between the two. The generation number is a function of the Yang—Mills gauge 
group Es ® Eg, while the Euler number is a function of the manifold Kg. 
Naively, we do not expect the two to be correlated. But the relationship between 
the two is established because we performed spin embedding, which breaks the 
gauge group and produces an intimate relationship between the group manifold 
K« and the gauge group. Thus, the essence of this important result is a direct 
result of compactification and spin embedding. 

Next, we would like to calculate the Euler number, and hence the number 
of generations, for a series of Calabi~Yau metrics. 

We showed earlier that a submanifold of the Kahler metric on C P,;. because 
it has the same Hermitian metric g;;, is also Kahler. Thus, we want to consider 
the set of submanifolds of C Py that have vanishing first Chern class c, = 0. 
This can be accomplished by placing constraints on the z's by setting certain 
polynomials to zero. 

Consider the submanifold of C P, obtained by the constraint 


5 
ee 0 (10.6.13) 
n=) 

It can be shown that this manifold has vanishing first Chern class and that 

x(M) is equal to —200. 

Let 


Y Ned) do s.noshy) (10.6.14) 


represent the submanifold obtained by setting k homogeneous polynomials of 
degrees d; to zero within C Py. Fortunately, the formula for the total Chern 
class, not just the first Chern class, of these submanifolds is known. For SU(3) 
holonomy, the total Chern class equals 

Gira. (10:6: 13) 

OS —a 0, 
ana a 

where J is a certain two-form obtained by normalizing the Kahler form of 
CP, ,3. By expanding this formula and then setting the first Chern class c; 
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equal to zero, we find 


De) Sea (10.6.16) 


i=l 
Thus, we find only five possibilities with vanishing first Chern class: 
x(Y%a;5)) = —200, 
X(%5:2,4)) = —176, 
x(%53.3)) = —144, (10.6.17) 
X(%6,3,2,2) = — 144, 
X (¥Y(7:2,2,2,2)) = —128. 


Thus, the generation numbers are unacceptably large! This is phenomenolog- 
ically undesirable because we know from arguments from nucleosynthesis, 
cosmology, and asymptotic freedom in QCD that we want very few 
generations, such as three or four. 

Fortunately, we can still reduce the generation number and make it much 
smaller by considering nonsimply connected manifolds. Let us divide the orig- 
inal manifold Mo by a discrete symmetry group G that acts freely on the 
manifold (i.e., no fixed points), yielding a manifold M. If the number of dis- 
crete generators of G is n(G), then the Euler number of the original manifold 
divided by the discrete group G is 


— Xx(Mo) 
x(M) = “a(G)’ (10.6.18) 
where 
eh 
M = Ee (10.6.19) 


Let us consider the previous example of the submanifold of C P, with 


5 


7 = (10.6.20) 


Lol 
Notice that this polynomial is invariant under the following symmetry 
operations: 


(Pane ety Nano Grd pee emer oye 
(Z1,Z25 +++ 25) —> (Z1, %22, @7Z3,..., 0°25), (10.6.21) 


where aq is a fifth root of unity. The discrete symmetry group Is 


Z; x Zs (10.6.22) 
which has 25 generators. Thus, the Euler number of this new manifold is 
—200 
5 (10.6.23) 


25 
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which predicts four generations. 
Many other types of models are possible with reasonably “low” generation 


number: 
Y(5:3,3 
x ( ) 
23 x Z3 


Y(- 
5 (7;2,2.2,2) ) Se 
2 x 2) x 2) 


. (“a2 EtG (10.6.24) 


= —40. 


The point here is not that we have obtained the correct phenomenology. The 
point is that with a clever choice of a discrete group, it is reasonable that we 
might be able to write a model with an acceptably low number of generations. 
We must stress that the very existence of chiral fermions in superstring theory 
is something of a miracle. In standard Kaluza—Klein theories, for example, 
there are serious obstructions to constructing a theory with chiral fermions in 
four dimensions. In fact, an acceptable supersymmetric Kaluza—Klein model 
with chiral fermions does not exist. Thus, it is quite remarkable that we can 
obtain any chiral fermions at all with superstrings. 

Lastly, the choice of a nonsimply connected manifold may at first be 
surprising, but it turns out to have other phenomenologically acceptable 
features. 


10.7. Wilson Lines 


Our previous discussion of embedding the spin connection into the gauge field 
in (10.5.3) broke the group Eg down to E, © SU(3), which has exceptionally 
good phenomenology because the fermions are in the 27 representation of Ey. 
Our next step is to break E, further to obtain the minimal group SU(3) ® 
SU(2) @ U(1). 

The problem here is that most naive methods of breaking E, down further 
will necessarily break N = 1 supersymmetry as well. However, there is one 
trick that we can still use on manifolds that are not simply connected. 

In general, we know that the path-ordered product of elements of a gauge 
group around a loop is gauge-invariant. We can express this as the Wilson loop: 


OF SPIERD (f a.dx"), (10.7.1) 


where P represents the ordering of each term with respect to the closed path 
C. For small paths, we find that U is proportional to the exponential of the 
curvature two-form. 
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Normally, when the curvature F wy Vanishes, we expect that the Wilson loop 
becomes unity. This is because we can always shrink the closed path C to a 
point. If the area tensor of the small closed path is given by A“”, then the 
Wilson loop becomes 


U —> efwa” (10.7.2) 


However, if the path is not simply connected, this argument fails. Thus, we 
can have vanishing curvature tensors, yet the Wilson loop does not have to be 
unity [1, 7]. 

This is ideal for our case, because we are now considering nonsimply con- 
nected manifolds. JfU does not equal |, then the group E. breaks down to the 
subgroup that commutes with U. 

Notice that E contains a maximal subgroup: 


SU(3), ® SU(3), ® SU(3)z, (10.7.3) 


where C is the strong color group and L and R represent left and right weak 
interactions. 

This breaking can be accomplished by choosing one element, Up of E¢, that 
satisfies 


Ur =1. (10.7.4) 


This element generates the permutation group Z,,. We now want the subgroup 
of Es that commutes with Up. In general, this will be a group of rank six. We 
can, for example, choose a specific element of E¢ to be 


p oO O y 0 0 
U=@yx| 0 B 0 Ix] Os 0 |, (10.7.5) 
eo 67 (BOW es 


where the matrices represent the elements of the three SU(3)’s within E¢, and 
a, B, y, 6 are all nth roots of unity. We then have the following breakings for 
various choices of these elements: 
o=1; yde=1-—> SU(3)c ® SU(2), ® U(1) @ U1) @ UC), 
y = 5 + SU(3)c @ SU(2), ® SU(2)rz ® U(1) @ UCI). (10.7.6) 
By selecting out different elements of E,, we can get different groups. For 
example, 


es = SU(3)c ® SU(2), ® SU(2)z ® UC!) ® U(1). (10.7.7) 


VE A. 
The explicit form of Up is 
a 0 0 ‘amen | aa 
U=()x] 0 a@& 0 0 pe O (10.7.8) 


670 aa 0 50° pee 
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Yet another choice is to fix the discrete group to be Z3, in which case 


= = SU(3)c ® SU(3) ® SU(3). (10.7.9) 


3 

In general, these solutions that contain the minimal model also contain other 
gauge interactions. In fact, there are 27 subgroups of E, that can yield at least 
the minimal model, and all of these have unwanted U(1) gauge groups that 
survive the breaking. These, of course, must be further eliminated, probably 
through another mechanism such as exploiting any “flat” directions in the 
superpotential. 


10.8 Orbifolds 


Although there are still problems with the Calabi-Yau compactification 
method, it is rich enough, in principle, to provide qualitatively the mechanisms 
for breaking the gauge group down to the minimal theory. 

In practice, however, Calabi-Yau manifolds are quite difficult to construct 
and only a few of them are actually known. We would like to have simpler. 
flat-space solutions to investigate. Unfortunately, the simplest toroidal com- 
pactification is unacceptable phenomenologically for, among other reasons, 
N = 4 supersymmetry survives after breaking. If we start with N = 1 su- 
persymmetry in 10 dimensions and compactify down to four dimensions, we 
wind up with N = 4 supersymmetry. (If we start with N = | symmetry in 
10 dimensions, the 16 supersymmetry generators Q“ after compactification 
become Q'*, where f is a spinor in four space-time dimensions and 7 ranges 
from | to 4. Thus, compactification of NV = 1 supersymmetry down toa smaller 
number of dimensions always leads to an extra O(1) symmetry. ) 

However, by constraining the toroidal compactification in a much more 
rigid fashion, we should be able to reduce N = 4 supersymmetry to N = 
1 supersymmetry. The proposal of Dixon, Harvey, Vata, and Witten was to 
compactify on an orbifold [8, 9] obtained by taking a manifold and dividing 
out by a discrete group, such that there are fixed points (1.e., points that do not 
change under the transformation). An orbifold, because of the singularities at 
these fixed points, is not a manifold, but apparently strings can propagate on 
these orbifolds without difficulty. The advantage of orbifolds is that they are 
flat, can break N = 4 supersymmetry, can product chiral fermions, and are 
easy to generate. 

The simplest orbifold is a cone. Simply take the complex plane and make 
the following identification: 


Zz = et tllnz (10.8.1) 


for some integer n. This divides the complex plane into 7 equivalent triangular 
sectors. Notice that this identification divides up the plane with the discrete 
symmetry group Z,,. 
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Notice that the origin is a fixed point under this transformation; i.e., the 
origin maps into the origin under this rotation. Now if we were to slice up the 
complex plane, extract just one of these triangular sectors, and then wrap up 
this sector according to the above identification, we arrive at a cone. Thus, the 
cone or orbifold is nothing but a two-dimensional space divided by the action 
of the discrete group 

R2 : 
cone = orbifold = —. (10.8.2) 
Zn 
The origin, which is a fixed point, now becomes a potential singularity. Thus, 
a cone is not a manifold. If we follow a path around the origin, the total angle 
we traverse is not 360 degrees, but 360 degrees divided by n. 

Let us take another simple example. Let us start with the two-torus defined 

by making the identification 


Z=Zt+i. (10.8.3) 


This divides the complex plane into an infinite number of squares of width | 
whose edges are identified with each other. Now let us create an orbifold out 
of this two-torus by dividing by Z>, which is generated by reflections 


P(z) = —z. (10.8.4) 


If we now construct the surface 

T) 

P 
we have an orbifold. Notice that the reflections leave four points invariant, 
which are the fixed points: . 

0, ‘, si, s(1 +i). (10.8.6) 

Three of these fixed points lie on the edge of the unit square, while one lies 
within the square. 

What does this orbifold look like? Under the action of the reflections, the 
points within the square are further identified with each other, making the 
square divide up into smaller squares of width 1 Now imagine two smaller 
squares, each of width 4. Place them directly on top of each other. Now 
sew the four edges of faa two squares together, forming a closed surface. 
Topologically, it is the same as the surface of a square beanbag. 

Notice that this surface has four singularities, corresponding to the four 
vertices of the square. If we followed a path around each of these fixed points, 
the angular deficit would be 180 degrees. 

It is possible, however, to convert this orbifold back into a normal manifold 
and hence calculate its Euler number. Let us cut off each of the four fixed 
points of T>, divide by Z, and then resew back a small patch at each of these 
fixed points. This is called “blowing up” a singularity. In this case, by cutting 


(10.8.5) 
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out these holes, sewing the squares together to form the orbifold, and sewing 
back four patches, we obtain a manifold topologically equivalent to a sphere. 

The Euler number of a disk d is equal to 1 and the Euler number of the two- 
torus is 0. Therefore, the Euler number of the torus minus four fixed points is 
equal to 


x(T, — 4d) = x(h) — 4x(d) = —4. (10.8.7) 
If we divide by the action of Z>, then the Euler number of the resulting surface 


is 
, (4 = ~) ae (10.8.8) 
Z2 


Finally, by gluing four disks back onto the surface, we must add four back to 
the Euler number: 


0-4 
+44) Sara (10.8.9) 


T, — 4d 
ze 


This checks with our intuition, because this manifold is equivalent to the two- 
sphere $2, which has Euler number 2. 

Now, we would like to generalize the previous example by taking a more 
complicated compactified surface in six dimensions. Let us first compactify 
the complex plane by making the following identification: 


| = Zep le 


a (10.8.10) 


SoG 


The first transformation divides the complex plane into an infinite number of 
narrow vertical strips. The action of both transformations divides the complex 
plane into an infinite number of equilateral triangles. The “fundamental re- 
gion” T of this space consists of two of these equilateral triangles, back to 
back. 

This space, because it consists of an infinite number of equilateral triangles, 
is invariant under rotations by 120 degrees: 


Ze eg. (10.8.11) 


Thus, this space has Z; symmetry. Under this rotation by 120 degrees, there 
are three fixed points, or points that are left invariant: 


eit 6 Vert / 
a0. —. ——. (10.8.12) 
v3 V3 
Notice that the fundamental region T, which consists of two equilateral trian- 
gles, contains three fixed points, one at the origin and the other two within the 
two equilateral triangles. 
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Now let us perform the folding operation on this space T and obtain an 
orbifold: 


9 
am 
Again, Z is not a manifold because the three fixed points (two of which occur 
within the region T itself) are potentially singular. 


To generalize to the six-dimensional case, we might consider simply taking 
the trivial product of three of these complex spaces: 


Z=ZxZxZ (10.8.14) 


Z (10.8.13) 


which has 3 x 3 x 3 = 27 fixed points. 

This space has the great advantage that the N = 1 supersymmetry in 10 
dimensions can be broken down to N = 1| supersymmetry in four dimensions. 
If we compactify on the space: 


M* x Z, (10.8.15) 
we note that the O(10) group is broken down as follows: 
SO(10) > SO(4) ® SO(6). (10.8.16) 


Since SO(6) = SU(4), we can show that the original Q* of N = 1, D = 10 
supersymmetry is now broken down into four spinors transforming as the 
components of 4 under SU(4). If we compactify naively on T;, then SU(4) is 
not broken and each of the four supersymmetry generators in four dimensions 
transforming as 4 survives. However, if we start with an orbifold by dividing by 
Z3, we can associate the Z; as belonging to the SU(3) subgroup of SU(4). Thus, 
dividing by Z3 necessarily breaks SU(4) symmetry, since there are no three- 
dimensional representations of SU(4). But only one of the four components 
within 4 survives the division by Z3, and hence only N = | supersymmetry 
survives, which is fortunate. 

In summary, orbifolds can be used to break the overall symmetry of the 
theory because only symmetries that commute with the discrete group survive. 
This method can be used to break the gauge group as well. 

Let g be a specific element of the 10-dimensional gauge group such that 


pe" = 1 (10.8.17) 


for some integer m. Then we demand that the states of the theory are those that 
commute with the combined action of 


Fae (10.8.18) 


Thus, we have a mechanism for breaking both gauge and supersymmetry 
simultaneously. 

As an example, we might take g? = 1, which is an element of SU(3). 
Thus, demanding that the states of the theory be invariant under the combined 
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operation yields the breaking of Eg: 
E,;@E,; —-> Es @SU(3)®@ Ez. (10.8.19) 


Now let us calculate the Euler number on this surface and hence the number 
of generations. Once again, we must cut off the 27 fixed points from 7, then 
divide by Z3, and then sew back on 27 patches. 

The Euler number of the surface T is zero, so the manifold J minus the 27 
disks d located at the fixed points has Euler number 


x — da) = —27. (10.8.20) 


If we divide by £3, the Euler number becomes —9. Now we must sew back 27 
disks d. This time, however, each disk has SU(3) holonomy and Euler number 
3, so the total Euler number is 


. (=) 4.27y(d) = 72. (10.8.21) 
3 


Thus, we have 36 generations of fermions. 

The previous example was only a toy model in six dimensions. It is possible. 
however, to construct orbifold models that have much fewer generations, as 
low as two or four. 


10.9 Four-Dimensional Superstrings 


In model building using orbifold compactification to produce four dimensions. 
stringent constraints must be met, such as modular invariance and the Ly = Lo 
conditions. These conditions are nontrivial, because modular invariance mixes 
up the boundary conditions. 

For example, when studying modular invariance, we can compactify on the 
six-torus 7, divided by a discrete point group P = Z,, such that the (twisted) 
boundary conditions for the orbifold 7;/P are as follows: 


X(o, + 27, 02) = hX(oj, 0), 
X(o}, 0) + 2) = gX(o1, 09), (10.9.1) 


ve 


where A and g are elements of P of order n, that is, ge” = hh" = 1. For 
the usual bosonic or fermionic boundary conditions, we have g and /) equal 
to ++. However, when compactifying on orbifolds, this condition must be 
generalized. 

The presence of g and h, of course, can break the overall symmetry of the 
theory, both for space—time and for the internal group. The group that survives 
the compactification is the subgroup that is unaffected by this process, i.e., the 
subgroup that commutes with g and A. 

To study how the one-loop trace is affected by the compactification, let us 
diagonalize g and /f in terms of their eigenvalues, g.h — {e77'"'}, where 
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v; = r;/n so that they are of order n. For simplicity, we take the following 
periodicity condition: 


-X(o + 27) = e*"!"X(o). (10.9.2) 


Notice that the Fourier decomposition of the string modes is now shifted. 
In general, we must now expand the string in terms of a new set of 
modes 


MG) == ie he ee (10.9.3) 


The presence of the v within the Fourier modes changes the trace calculation 
of the one-loop amplitude in a subtle but important fashion, which will put 
nontrivial constraints on the v;. 

In the trace calculation, the presence of v enters in the zero point energy. For 
the usual bosonic string, for example, the unregularized Hamiltonian contains 
the factor ; Y,, A-n@n + Ana_y. This, of course, has infinite matrix elements 
and must be normal ordered. Technically speaking, normal ordering the cre- 
ation and annihilation operators creates an infinite zero-point energy given by 
5 >-,, n. This infinite zero-point energy can be handled in several ways (e.g., 
requiring the light cone theory to be Lorentz invariant), but it can be shown that 
this yields the same results as zeta function regularization. The zeta function 
is defined as 


2 
c() = =. (10.9.4) 
n=1 


This function is analytic in s, and we can analytically continue the zeta function 
to s = —1 and show that ¢(—1) = — 4. This is the desired answer, because 


¢(—1) is the analytic continuation of )°°~_, n. 
Now use zeta function regularization to compute the contribution coming 
from the shifted modes. We use the fact that 


l 
5 y | n= Y Cee —., + 41 — a). (10.9.5) 
neZ+a neZ 


The a(1 = a) factor is the new contribution to the zero-point energy in the trace 
calculation arising from the shifted modes of the orbifold. 

Now consider the heterotic string, where the trace over the single loop yields 
a factor containing the left- and right-moving energies: 


e2ti(tEL—tER) | (10.9.6) 

Under the transformation t + t +2, the expression remains invariant if we 
set 

n(E, oa Ep) ==) modal: (10.9.7) 


If we label the eigenvalues for the right-moving sector as e*"' and the two sets 
_ of left-moving eigenvalues as e77'"" and e*”'"" for the O(16) @ O(16) subgroup 
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of Ex @ Eg, then from (10.9.5) we have the following zero-point contribution 
to the ground state energy: 


I 
1 
pS . me vii — I+ 5+ (1 @ 2). (10.9.8) 


Since v = r/n, we can put everything together until we have the following 


constraint: 
ei = dort op (10.9.9) 
i J i} 


which is true mod n for odd n and mod 2n for even n. 

Now consider the effect of these constraints on the compactification dis- 
cussed earlier with the Z-orbifold, i.e., 7,/Z3, so that the point group P is of 
order 3. This means that 3v,; and 3v2; can be set to be lattice vectors on Eg. 

The constraints from modular invariance tell us that v; 5, should be ; times 
some integer. But since 3v,.2; is a lattice vector of Es, we can always fix v- to 
be 5 times some integer. Finally, we know that any point in eight-dimensional 
space containing the EF; lattice is within a distance of | from some lattice point. 
This means that we can always choose v; 5; < 1. 

Modular invariance is so rigid that there are only five solutions consis- 
tent with the constraint (10.9.9) on the various v’s. Each of the five sets of 
v’s breaks the symmetry group Eg ® Es down to the subgroup that com- 
mutes with the g and A twist factors. It is not hard, given the explicit form 
for the v’s, to calculate what this subgroup is. Because 3v is equal to a 
lattice vector, the subgroup that survives after symmetry breaking is the 
group that commutes with these lattice vectors. We simply quote the five 
solutions [8, 9]: 


(1) 


ate Soe (10.9.10) 
i= 


The solution has no chiral fermions and hence is unphysical. 


(2) 


~ 
=N 

II 
wih 


ee, = 0. (10.9.11) 
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This leaves us with the group Eg ® SU(3) @ Es. 
(3) 


8 
eS (10.9.12) 


This leaves us with the group E; ® U(1) @ SO(14) @ U(1). 
(4) 


8 8 
2: gee DP 
» Vij = » U5, =a 30 
fi i=] 
= 7p 
Uy; = 035 = a ree “ye (10.9.13) 


This yields E, @ SU(3) @ Es @ SU(3). 
(5) 


8 
es (10.9.14) 
i=] 
1 ! 
Vvyie= 3? i i, 3? z, eae 
U2; (a a.:). 


This leaves the symmetry group SU(9) ® SO(14) @ U(1). (The symbol... 
represents a Series of zeros.) 


The five groups we have constructed so far with orbifolds do not resemble 
the Standard Model at all. The gauge groups are still too large, and there are 
too many generations. Models of this type often have 27 generations because 
we can always construct string fields that are twisted around the 27 fixed 
points of the orbifold, yielding a redundancy of 27. However, we can further 
reduce the gauge group and control the number of generations by postulating 
the existence of background gauge fields (Wilson lines) that correspond to 
noncontractible loops on the torus. This can, in principle, yield models with 
only three generations and the gauge group SU(3) @ SU(2) ® U(1)". As in 
the previous discussion, the gauge group that survives will be the subgroup 
of Ex ® Eg that commutes with the Wilson line. In the case of orbifolds with 
Wilson lines, the gauge group that survives is the subgroup of Es @ Eg that 
commutes with P = Z; as well as the Wilson line. 
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For example, define the Wilson integral parametrized by a/: 
[a ax 2 A ee a, (10.9.15) 


where i = 1—6and/ = 1 — l6and e! define the lattice of the six-torus. Thus, 
we have now introduced a new vector a/ in our constraints that will allow us 
to construct new solutions. We can show that 3a/ and 3v’ are equal to a lattice 
vector. 

Under a modular transformation t — —1/t, the Eg ® Ex lattice vectors 
transform as 


p' > p' +0! +n,a/, (10.9.16) 


where n; = 0, £1. Different values of n; will yield different twisted sectors. 
Repeating all the arguments we used earlier, we can also show that the modular 
transformation t > t +3 results in the constraint [10, 11] 


3(v’ + na}? = 2m (10.9.17) 


for some integer m. This is the desired constraint emerging from modular 
invariance, which is much less restrictive that (10.9.9). 

An extraordinarily large number of solutions are now possible [10, 11] be- 
cause of the presence of the a/ term. However, let us select one such possibility 
that yields several desirable features, such as three generations. We postulate 
the following set of values for the v/ and a’: 


Piirizolay 

- B03 9) 30s eae 

a (10.9.18) 
38092 Ss al Sen 

and 

ee 0: 0, 0), 

af =} ae aah) (10.9.19) 
(0, 0, 0, 0, 0, 3, 0, 0), 
0,0, 0, . 0,0, 052); 

ay = (10.9.20) 
4.4.4, =, 0, 0, Oye) 


We can check that in the untwisted sector, the subgroup of Ey that commutes 
with the action of the Wilson line is given by 


[SU(3) ® SU(2) @ U(1)°], (10.9.21) 


while the other Eg is broken down to SU(2) @ SU(2) @ U(1)°. 

Furthermore, we should note that the number of generations has been re- 
duced to three. (This is because the 27 fixed points can be further divided into 
three sectors of nine fixed points if there is a Wilson line in one of the three 
complex dimensions. Adding two Wilson lines creates nine sectors of three 
fixed points. A careful choice of the v’ anda! will kill all but one sector, leav- 
ing us with three generations.) As a further bonus, this model has the desirable 
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property that extra colored triplets, which might mediate fast proton decay, are 
absent. 

The limitation of this model, however, is that there are too many U(1) factors, 
which is bad for phenomenology. In fact, the Wilson line technique does not 
reduce the rank of the group (because we are dividing by a discrete group), 
so we still have gauge groups of rank 8, which is too large. These extra U(1) 
factors, moreover, must be checked to see if they are anomalous. 

So far, none of the solutions obtained with orbifolds [12-25] (or Calabi— 
Yau manifolds) has exactly the desired low-energy structure. There are still 
problems getting the right generation number, the right gauge group at low 
energies, acceptable values for proton decay, etc. The point, however, is that 
these methods of compactification gives us the ability to construct potentially 
thousands of solutions that may be compatible with modular invariance. 

What is required, of course, is a systematic way to compute all four- 
dimensional physically relevant solutions to the string equations. So far, the 
bulk of the work has been to postulate a particular compactification scheme 
and then check for consistency with modular invariance. Recently, there has 
been considerable work in precisely the other direction, i.e., starting with 
modular invariance from the beginning and then searching for all possible 
compactification schemes compatible with it. 

This new program at the outset demands the following set of criteria for 
any physically relevant model; namely, the model must be: (a) tachyon-free, 
(b) anomaly-free, (c) modular-invariant, (d) supersymmetric, and (e) four- 
dimensional. So far, there are only preliminary results toward this ambitious 
program, but the early results have been encouraging. 

What we want is a way to calculate all coefficients C within the one-loop 
amplitude (5.9.6) for each spin structure. For example, in Sections 5.9 and 5.11 
we analyzed the one-loop and multiloop spin structures and how they changed 
under modular transformations. Now we want a systematic study of the effect of 
modular invariance on the multiloop spin structures [26-3 1]. Let us begin with 
the expansion of the string amplitude in terms of all possible spin structures: 


: a=de[s |4[o |: (10.9.2) 


where a and b label various spin structures over the a and b cycles, each element 
within a is equal to either 0 or I, C is a coefficient that must be determined, 
and the A’s represent the amplitude for each spin structure. For each fermion, 
there are 2°8 different spin structures on a Riemann surface of genus g. 

Let us rewrite the equations of Sections 5.9 and 5.11 in a more systematic 
fashion. For example, the theta functions must satisfy (5.11.5) when we make 
an Sp(2g, Z) transformation on them. Demanding modular invariance at the 
one-loop level means that C must satisfy the change in boundary conditions 
, given by (5.9.7) and (5.11.17). In this new notation, we can rewrite these 
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constraints 
a] __fin/8) Nay * 10.9.23 
al l= Cc geceael: ( ) 
and 
a ? b 
c| = dnnEernc | | : (10.9.24) 
a 


where the sum )~ is taken over f fermions (taking left movers minus right 
movers), each ay takes on value 0 or 1, and vector sums are only mod 2. 

When we demand modular invariance at the multiloop level, we must place 
further restrictions on the C’s. For example, unitarity requires the ability to 
factorize the amplitude describing a genus g surface into the product of genus 
1 tori; i.e., by (5.1.7) we should be able to slice up a genus g surface into 
different genus | surfaces by inserting a complete set of intermediate states. 
In our new notation, this means 


a! a® a! a’ 
el Me i es ee) arg (10.9.25) 


As we saw in Chapter 5, Dehn twists can mix up the (a, b) cycles on a genus 
g surface, so we must also demand (at least at the two-loop level): 


1 (it /4) > ara’ a a’ = a a’ 
65 é€ Ole ela. acle a ale a te 
(10.9.26) 


where 5 equals + 1 (— 1) if the state is a spacetime fermion (boson). 

What is remarkable is that a rather simple solution of these constraints is 
possible. 

First, let us make a few definitions: 


(1) Letus replace the coefficient elie] with the equivalent expression C(q@ | f). 
where aw (f) is the set of fermions that are periodic around the a (/) cycle. 
(2) Let us introduce a simple “multiplication” rule: 


aB=aVU B—aNB. C1g9:27) 


This means that a’ = a — a = ©, the empty set. Let F equal the set of all 
fermions. Then the product Fa equals the complement of the set a, that 
is, Fa = F -—a. 

(3) Let us introduce 4,, which equals +1 for fermions and —1 for bosons. 

(4) Also n(a) is the number of left-moving fermions 7; (a) minus the number 
of right-moving fermions np(q). 

(5) Letus define e, = e'"""”* and the parity operator (—1)* fora spin structure 
a that obeys 


(-1)*yv =-w(—-1)* if w belongs toa, 
(-l)*¥ = w(-1)” if w does not belong toa. (10.9.28) 
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A spin structure (a | 8) is compatible with supersymmetry if 
(—1)*Tr(—1)* = 8,7, (10.9.29) 


where Tr is the generator of supersymmetric transformations in the 
superconformal group. 

(6) Define Y as a collection of sets with the above supersymmetric prop- 
erty (which forms a group closed under our definition of multiplication 
(10.9.27). Define & to be a subgroup of Y that satisfies 


Cla; |S) = on, (10.9.30) 
miner (C(S |G) = 1. 


Now that we have our definitions, let us state the constraints arising from 
modular invariance. Because n(q@) is a multiple of eight, we have the constraints 
on the basis elements b; of &: 

n(b;) = 0 mod 8, 
n(b; 1b;) = 0 mod 4, (10.9.31) 
n(b; 1) b; ia) b, N b;) = 0 mod 2, 


as well as the constraints emerging from modular invariance: 


+1 ifa,BeS, 
(G = 
(18) | 0 otherwise, 
C(@ | B) = eampC(B |), (10.9.32) 


C(@ | @) = EréyC(a | F), 
C(a | B)C(a | y) = buC(a | By), 


for all a, 8, y within the collection &. 

Lastly, we have the constraints coming from the conformal anomaly cancel- 
lation, which must always be strictly observed. Let us use the fermionic (not 
bosonic) representation of the lattice compactification. This was used for the 
heterotic string in (9.4.4). We must carefully include the fermionic contribution 
to the conformal anomaly in the super-Virasoro algebra. It is rather remark- 
able that a representation of the fermionic partner 7; to the energymomentum 
tensor 7, can be given strictly in terms of the adjoint representation of fermions: 


Tp ~ fey. (10.9.33) 


When we calculate the anomaly arising from this term, we find that the 
fermionic contribution to the anomaly is 4N, where N is the number of 
parameters in the group. ; 

In the left-moving sector, the number of compactified dimensions is 26 — D, 
where D is the number of space-time dimensions. We represent the compact- 
ified sector by fermions in the adjoint representation. In this sector, the three 
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contributions to the anomaly are 


re D 
left movers: b,c ghosts } > { —26 }. (10.9.34) 
v" IN 


Since the sum of the three contributions to the anomaly must be zero, we 
obviously have N = 2(26 — D) fermions in the set y*. 

Now let us analyze the right movers, where the anomalies must also sum to 
zero: 


DG 3D/2 
right movers: bee is Y es Da aie (10:9.35) 


The condition for zero anomaly is therefore N = Bors — D). Including the 
(D — 2)NS-R fermions (in the light cone gauge) contained in y,, we have 
D —2-+3(10 — D) fermions for the right movers (in the light cone gauge). In 
summary, we must have the following number of fermions in order to cancel 
the anomalous term in the Virasoro algebra: 


a movers — 2(26 — D), (10.9.36) 
right movers > D — 2 + 3(10 — D). 


The anomaly cancellation is automatic if we have this many fermions in the 
left- and right-moving sectors. 

Now that we have explicitly expressed all our constraints, our strategy is as 
follows: 


(a) We first calculate the total number of spacetime fermions y and internal 
fermions w“ that satisfy (10.9.36), which removes conformal anomalies 
from the theory. Then we calculate the total number of spin structures that 
arise from this set, which we call F. 

(b) We next randomly choose a collection of spin structures as our initial set 
within & that includes F,, the complete set. 

(c) We then check for closure under multiplication (10.9.27), which generates 
the full group & compatible with this initial choice. (By choosing different 
initial sets for ©, we can reproduce different compactification schemes.) 

(d) We then calculate the square matrix C(@ | B) defined over the set E 
that satisfies our constraints. Some arbitrary phases are introduced in this 
fashion, which allow us to generate more than one solution for each set © 


We will now discuss some of the properties of the solutions to these 
equations. Remarkably, we find that all consistent solutions necessarily have 
gravitons, dilatons, ane antisymmetric tensors. We also find that the presence 
of the massless spin-3 field is sufficient to prove the absence of tachyons and 
the vanishing of the cosmological constant at the single- loop level. These are 
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encouraging results, and they show there is tremendous self-consistency within 
these equations that yield phenomenologically desirable results. 

Now let us discuss a few specific solutions to these equations. We will 
first discuss the 10-dimensional Type II theory (without compactification) in 
the light cone gauge, after we have imposed all the light cone constraints on 
our fermions. Then the fermions consist of the eight-component right-moving 
space-time fermions wr = y!~® and the left-moving fermions a, = W'!~8, 
and the internal fermions are set to zero, yy“ = 0. 


(1) The simplest choice is 
a= {oF}, (10.9.37) 


Unfortunately, a careful analysis of the spectrum of this theory shows that 
it has tachyons and hence is unacceptable. 


(2) The choice 


— 


B= {O, a1, ap, F} (10.9.38) 


has more than one solution, depending on the choice of certain phases 
within the C(q@ | 6) matrix. The two choices reproduce the Type ITA and 
Type IIB theories. 

For the four-dimensional compactified theory, we must choose a different 
set &. In four dimensions, the spacetime fermions are represented by 
the right-moving transverse w!~* and left-moving w!~*. According to 
(10.9.36), the internal degrees of freedom can also be represented by 18 = 
3(10 — D) fermions, in which case we have A'~!® for the right movers 
and A'~'8 for the left movers. We will regroup these 18 internal fermions 


(3 


— 


collectively as (x’, y’, z’), where / = 1,..., 6 and represents six SU(2) 
factors. We then choose the subset a = (w!~’, z'~°). Then the choice 
fo =12,0, Fo, F) (10.9.39) 


produces several N = 4 supersymmetric four-dimensional theories, with 
gauge symmetry SU(2)°, SU(4) @ SU(2), or SU(3) @ SO(S). 


Now, let us analyze the 10-dimensional heterotic string in the light cone 
gauge, where we have the transverse right-moving space-time spinors @ = 
y'-8 and the 32 = 2(26— D) internal left-moving spinors a, = Far = A'~”. 


(1) The choice 
a = {o, F} (10.9.40) 


yields an SO(32) theory with tachyons and no massless fermions, which 
is unphysical. 
(2) Uherehoice 


& = {P, ar, a, F} (10.9.41) 
yields the usual Spin(32)/Z> heterotic string. 


448 10. Calabi-Yau Spaces and Orbifolds 


(3) If we define a; = {A'—!5} and a) = {A'’—**}, then the set 
E = {P, ar, 01, W2, MOR, CHR, A}, 2, F} (10.9.42) 


has two possibilities, depending on the choice of phases. One phase choice 
leads to the standard supersymmetric Eg ® Eg theory, and the other choice 
leads to the (nonsupersymmetric) SO(16) @ SO(16). 

(4) After a four-dimensional compactification, we can again represent the 


right-moving internal fermions as (x/, y’, z') for J = 1-6 and choose 
a = (y'-*, z'-°). Then the simplest choice 
S=jAC afar) (10.9.43) 


yields an N = 4 supersymmetric theory with SO(44) gauge symmetry. 
This is one of the solutions found by Narain using even self-dual Lorentzian 
lattices. 


Clearly, there are probably thousands of different sets of = that close 
under the multiplication rule (10.9.27) and satisfy the modular constraints. 
This formalism gives us a handle on the problem of constructing a complete 
classification of all possible compactified solutions compatible with modular 
invariance. 

Before concluding this chapter, it is worth pointing out other directions that 
compactification may take such as “asymmetric orbifolds,” which are larger 
than the set of orbifolds discussed so far. Asymmetric orbifolds [31] may also 
give us a handle on classifying the thousands of modular invariant solutions 
to the string equations. Asymmetric orbifolds are string theories in which the 
left- and right-moving degrees of freedom live on different orbifolds. When 
compactifying |0-dimensional space-time, we have a natural bias toward com- 
pactifying symmetrically in the left- and right-moving sectors, but there are 
consistent modular-invariant asymmetric compactifications in which the six 
space dimensions are treated asymmetrically. 

Although it is difficult to visualize how to twist on asymmetric orbifolds, it 
can be shown rather simply that modular-invariant solutions exist. For example, 
let us mod out by a factor of g, where g belongs toa discrete group. The partition 
function for the single-loop diagram must now trace in the t direction in the 
presence of this g factor: 


Tr gqttg he, (10.9.44) 


The calculation of the trace can be done exactly as before, except now the spaces 
are different. As before, we find that the condition for modular invariance is 
level matching: 


n(E;, — Er) =0mod 1. (10.9.45) 


If the eigenvalues of g are represented by e*™'" ", this condition gives us re- 
strictions on the sum 5~, r7. The resulting constraints on the r; are almost 
identical to the ones found for the symmetric orbifold (10.9.9), except the left- 
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and right-moving sectors are now treated differently, depending on the struc- 
ture of space. Although asymmetric orbifolds are more difficult to work with 
than symmetric orbifolds, a large class of asymmetric orbifolds can actually 
be constructed that are compatible with modular invariance. (In practice, the 
asymmetric orbifold is sometimes embedded in a larger symmetric orbifold, 
where the calculations are simpler, and then truncated at the end to retrieve the 
asymmetric orbifold.) 

In summary, the advantage of asymmetric orbifolds is that they allow us 
to classify an enormous class of modular invariant compactifications in four 
dimensions. This will be of great help toward understanding the complete set 
of physically relevant superstring compactifications. 


10.10 Summary 


We have seen that a few physically reasonable assumptions about the com- 
pactification process have led to a wealth of phenomenological predictions. 
Although no model has yet been proposed that can successfully predict all 
the known properties of the low-energy particle spectrum, we are encour- 
aged by the qualitative results we have obtained. Let us now specifically 
isolate the logical sequence by which given assumptions led to certain 
conclusions. 

Our natural choice is to compactify the 10-dimensional space on the 
six-torus. However, compactifying on this surface transforms N = | 
supersymmetry in 10 dimensions into N = 4 supersymmetry in four di- 
mensions. Thus, we must search for new assumptions that will be more 
phenomenologically acceptable. 

We began with the following assumptions: 


(1) The 10-dimensional universe has compactified to 
Mi > My x Ke, (10.10.1) 


where M, is a maximally symmetric space, 


R 
Ryvap = 75 (Bua8vp — 8up8va)s (10.10.2) 


and K is compact. 
(2) An N = 1 local supersymmetry remains unbroken after the compactifica- 


tion. 
(3) Some of the bosonic fields can be set to zero: 


p= doiese (10.10.3) 


The assumption that N = | supersymmetry survives the compactification, 
is a stringent assumption. It implies that the variation of the spinors must be 
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Zero 
| 
OWA = gee = 0, 
8x° =P Fite =0. (10.10.4) 


The existence of a covariantly constant spinor ¢ places an enormous number 
of constraints on the manifold. In particular, if we differentiate the equation 
once again 


Dye =0—-> Pee D; je = 0, 
=> Rijul“e =) (10.10.5) 
Thus, we can show that the space is Ricci flat. Second, we can show that 
the space has SU(3) holonomy. We define the holonomy group as the group 
generated when we take a spinor around successive closed paths around a point. 


Normally, in six dimensions the holonomy group is SO(6), or equivalently 
SU(4). But if € is covariantly constant, it means that 


é— Ue. (10.10.6) 
By an SU(4) transformation, any spinor can be brought into the form 
0 
0 
E> Eeé= vole (10.10.7) 
€0 


This, in turn, means that the U matrix rotates only the three zero indices in 
the column matrix. Thus, U must be a member of SU(3), and so the space has 
SU(3) holonomy. 

In general, spaces with SU(3) holonomy are notoriously difficult to deal 
with. However, we can show that the space is Kahler and so use a powerful 
theorem by Calabi—Yau. We can always define the tensor 


J 19 Ele. (10.10.8) 


which satisfies J/> = —1, just like i> = —1 for the usual definition of complex 
numbers, and so we can usc this tensor to define a complex manifold. However, 
we know that this tensor must be covariantly constant (because ¢ is covariantly 
constant), and thus the space is Kahler. 

Fortunately, Kahler manifolds that are Ricei flat (and hence have vanishing 
first Chern class) can be shown to be equivalent to spaces of SU(3) holonomy 
and are easy to construct. Thus, we have the logical sequence 


D,e = 0 — Kis Ricci flat, Kahler, with vanishing first Chern class. 
(10.10.9) 
The problem, however, is that now we are flooded with potentially thousands 
of manifolds preserving N = | supersymmetry! 
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So far, we have not broken the Es ® Eg gauge symmetry. Our next step is 
to notice that the Bianchi identities, which used to be empty identities, now 
become nontrivial with our assumptions: 


TOR AR WE Ar. (10.10.10) 


This equation is surprisingly difficult to satisfy. On the left, we have the curva- 
ture forms over the Riemannian space, and on the right we have the Yang—Mills 
curvature forms. In order to solve this rather strange-looking equation, we will 
embed the spin connection into the gauge field, thereby mixing the two man- 
ifolds and breaking the original gauge symmetry. Because we have SU(3) 
holonomy in the Riemann sector, we will have the following breaking in the 
gauge sector when we mix the two sectors: 


Bianchi identity — spin embedding > SU(3) @ E, @ Eg. —_—(10.10.11) 


Fortunately, this gives rise to desirable results. We know that Eg by itself does 
not have chiral representations and hence is not an acceptable candidate for 
model building, but Es does. In fact, the decomposition under this breaking of 
the 248 elements of the adjoint of Eg becomes 


248 = (3, 27) @ GB, 27) @ (8, 1) @ (1, 78). (10.10.12) 


This is gratifying, because the 27 is the most suitable representation for the 
quarks and leptons in GUTs with E,. Thus, we have 


TrR A R= iTrF A F — spin embedding > 27 fermions. (10.10.13) 


The next question is: How many generations of the 27 do we have? Nor- 
mally, the generation number has nothing to do with the gauge group. In GUTs, 
they are entirely distinct. We can have an arbitrary number of exact copies of 
fermion generations in GUTs. However, in string theory the process of spin 
embedding produces a constraint on the generation number. Because of spin 
embedding, the gauge fermions are now linked to the K, manifold. The gen- 
eration number can be viewed as a topological number, because the difference 
between positive and negative chiral zero eigenvalue solutions of the Dirac 
equation is a topological number. Specifically, 


generation number = $1x(M)I. (10.10.14) 


Unfortunately, the Euler number of Ricci-flat Kahler manifolds is usually quite 
high, but we can always reduce this number drastically by taking a submani- 
fold. For example, we can divide by a discrete group that preserves a certain 
polynomial in the coordinates. A good example is C P; divided by Zs x Zs, 
which is not simply connected. This manifold has four generations. 

Next, we want to break the model even further to the Standard Model SU(3)® 
SU(2)@U(1) without destroying N = | supersymmetry. Normally, this is quite 
difficult. Breaking schemes that break the group down to the standard model 
also break N = | supersymmetry. One solution to this problem is to use the 
method of Wilson lines. We saw earlier that the manifolds K, that we are 
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considering are not simply connected in order to produce a low-generation 
number. For nonsimply connected manifolds, the Wilson loop 


U= Pexp (/ Aydx") (10.10.15) 
re 


does not necessarily equal | even if the curvature tensor vanishes. This is 
because we cannot always shrink closed looped into points. Thus, we can 
break Es symmetry down to a subgroup G by choosing an element of U so 
that G commutes with all elements of this element. For example, if we take an 
element of U that is Z; x Zs, we find 


i. SU(3)c ® SU(2), ® SU(2)p ® UL) @ U1). (10.10.16) 
Zs x Z5 


Unfortunately, we see that, in addition to arriving at the standard model, we 
also have unwanted U(1) groups. Thus, 


Wilson lines ~ SU(3) @ SU(2) ® U(1)". 


Orbifolds are another way of compactifying the superstring, which is prob- 
ably the limiting case of a Calabi—-Yau space. Orbifolds are created by taking 
a torus 7, and dividing by a discrete group Z,,, which allows fixed points 


orbifold: tS (10.10.17) 


(These fixed points apparently do not spoil the properties of the string model.) 
Nontrivial constraints are placed on orbifolds by modular invariance, which 
mixes the boundary condition nontrivially. The boundary conditions 


X(o; + 270. 02) — hx(o;, 0>), 
X(o1, 0) + 22) = gX(o1, -), (10.10.18) 


(where A and g are elements of Z,,) can break the symmetries of the string 
model. The subgroup of the space-time group and the internal group that sur- 
vives the process is the subgroup that commutes with g and /1. If we diagonalize 
g and / to equal the elements e*”'" , where v; = r;/n for some integer n, then 
the zero-point energy of the Hamiltonian is shifted by an amount proportional 
to u;(v; — 1). Because the trace over the single loop is sensitive to the zero- 
point energy, this means that modular invariance places a restriction on the 
eigenvalues: 


8 8 8 
Dor = Dorit op (10.10.19) 


where r; are the eigenvalues from the right-moving sector, and the two sets 
of eigenvalues r;>,; are from the left-moving Eg ® Eg sector. If we include 
Wilson lines, then the group that survives the compactification is the subgroup 
that commutes with g, A, and the Wilson lines. 
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Phenomenologically, there are many solutions to this compactification, some 
with three generations and the group SU(3) @ SU(2) ® U(1)", which yields too 
many U(1) factors. Unfortunately, the method of Wilson lines does not change 
the rank of the group, so in general we will have too many U(1) factors. 

The great advantage of orbifolds over Calabi—Yau spaces is that they are sim- 
ple, flat, and thousands of them can be explicitly constructed. Unfortunately, 
both suffer from the problem that there is no systematic way to catalogue these 
thousands of solutions. 

One step in this direction is to use modular invariance and the absence of 
tachyons and anomalies to systematically derive all possible solutions. We 
begin with the amplitude written as a sum over spin structures: 


a=de[ 3 |4l |: (10.10.20) 


Now we demand that the C’s factorize properly, have modular invariance, and 
yield models that have no tachyons or anomalies. Remarkably, it is possible 
to solve the constraints on C and obtain solutions. The simple ones reproduce 
some of the known compactifications, but thousands of other compactifications 
are still being investigated. This yields the hope that we may be able to exhaust 
all possibilities and obtain a realistic model. 

Another way in which to generate large classes of four-dimensional solu- 
tions is through asymmetric orbifolds. The heterotic string is an example of an 
asymmetric compactification, i.e., treating left- and right-moving sectors dif- 
ferently. Asymmetric orbifolds are the largest class of orbifolds found so far 
and may overlap significantly with the work of others using different types of 
compactifications. (In addition to asymmetric orbifolds, we should also men- 
tion the possibility of non-abelian orbifolds, i.e., orbifolds based on dividing 
out by non-abelian finite groups such as the crystal groups. The advantage of 
such constructions is that they help us eliminate some of the unwanted U(1) 
factors and also yield three and four generations. See [40].) 
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M-Theory 
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CHAPTER 11 


M-Theory and Duality 


11.1 Introduction 


Up to now, none of the methods discussed in this book have provided us with 
deep insights into nonperturbative string theory and its true vacuum. There 
are still gaps in our understanding of the string perturbation theory because of 
the problems associated with supermoduli. Furthermore, string perturbation 
theory cannot yield the true vacuum of string theory. Conformal field theories, 
although they give us a powerful tool to classify millions of possible vacua 
for string theory, are still defined perturbatively and are. unable to probe the 
nonperturbative region. String field theory, although it is defined independently 
of perturbation theory, is currently too difficult to solve for the nonperturbative 
region. 

One of the most exciting new approaches to nonperturbative string theory 
involves M-theory and duality [1-6], which in fact force us to reconsider the 
central role played by strings in supersymmetry. Not only have they given us 
surprising results concerning the nonperturbative behavior of string theory, 
they have also forced us to reconsider previously discarded theories, including 
supergravity and membranes of various dimensions (called “‘p-branes”) which 
are now seen in an entirely new light. In particular, duality allows us to show that 
the five different self-consistent superstring theories are nothing but different 
solutions of a single theory, called “M-theory.” In this revised picture, the 
various string theories are nothing but different vacua of a single theory. While 
perturbation theory only probes the vicinity of each vacua, duality allows us 
to make stunning nonperturbative correlations across different vacua. 

Although M-theory and duality have yet to give us definitive information 
about four-dimensional vacua with N = | supersymmetry, they have already 
clarified much of the nonperturbative nature of string theory in 10, 8, and even 
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6 dimensions, giving us a complex web of dualities between different string 
compactifications. 

Furthermore, M-theory indicates that the “true home” of the theory may 
actually be the eleventh dimension, where we find new, exotic objects, such as 
super membranes and 5-branes. 

Just a few of the surprises and insights of M-theory are as follows: 


i 


All five string theories, which on first glance have entirely different 
properties and spectra, are now seen as different vacua of the same theory. 
The strong coupling limit of Type IIA and Ex @ Ex heterotic string theory 
is revealed to be an 1|1-dimensional theory, whose low-energy action is 
given by 11-dimensional supergravity. Compactification of M-theory ona 
circle (line segment) yields Type IIA (heterotic) string theory. 

The complete action of M-theory is unknown, but is believed to contain 
membranes (2-branes) and 5-branes. Closed strings in lower dimensions 
can be viewed as compactifications of these membranes. 

A vast network of perturbative and nonperturbative dualities (called S-T—-U 
dualities) has been established between different string theories in various 
dimensions, for the first time giving us insights into the nonperturbative 
behavior of string theory. 

Exact statements about the spectra and properties of string theory, inde- 
pendent of perturbation theory, can be made by analyzing the properties of 
“BPS saturated states,” which obey nonrenormalization theorems. These 
BPS states can be enumerated from group theory alone by analyzing the 
central charges in the supertranslation algebra. 

The nonperturbative regime of string theory yields entirely new types of 
membranes, called “D-branes,” which play an essential role in the theory. 
Open strings with Dirichlet boundary conditions can end on these D- 
branes. Surprisingly, these D-branes allow us to use ordinary conformal 
field theory techniques to calculate highly nonlinear interactions. 

By counting D-brane states. we can derive the Bekenstein—Hawking 
entropy formula for black holes using statistical mechanical arguments. 

1 1-dimensional M-theory, in the infinite momentum frame, is equivalent 
toa 10-dimensional theory of point-particle Dirichlet 0-branes in the N — 
oo limit, which is called M(atrix) theory. Remarkably. this matrix model is 
based ona simple U(N) quantum mechanical Yang—Mills theory, to lowest 
order, with a finite number of degrees of freedom. 


.2 Duality in Physics 


Duality is actually an old phenomenon. Maxwell's equations. for example, are 
invariant under the following duality transformation: 


E — B, 
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ees Oe (12.1) 


if we interchange 


ep, Clie2.2) 


where g represents the charge of a monopole. Dirac’s theory of monopoles 
shows us that 


eo= nn, Ce? 3) 


where n represents an integer. This leads us to a curious conclusion, that 
Maxwell’s theory of electric charges defined in the strong coupling region 
(where e becomes large) is equivalent to a weakly coupled theory of monopoles 
(where g becomes weak) and vice versa. The strong coupling region of electro- 
dynamics, which is normally prohibitively difficult to analyze, is now revealed 
to be an ordinary theory of monopoles at weak coupling. Although this result 
was long thought to be a curiosity, it represents one of the few instances in 
field theory where the nonperturbative region can, in principle, be analyzed 
and shown to be deceptively simple. 

Attempts to generalize this to Yang—Mills theory and to supersymmetry, 
however, have been mixed. There exists a classical formula from the theory 
of dyons (classical solutions of gauge theory with both electric and magnetic 
charges) that states 


M? = (o)*(e? + g’), | (11.2.4) 


where M is the mass, (¢) is a constant, and e and g represent the electric and 
magnetic charges. Notice that this gauge theory solution obeys the relationship 
e€ <> g, So it was conjectured that this may, in fact, be an exact symmetry of 
the theory [7-8]. However, attempts to prove this kind of relationship at the 
quantum level have floundered over the years. 

Certain tantalizing clues were noticed over the years which pointed to the 
importance of duality. It has long been suspected that the so-called BPS rela- 
tions, which generalize the above constraint, are unrenormalized by quantum 
effects due to supersymmetry and hence may hold exactly in the complete 
theory. This led some to suspect that perhaps duality may be preserved non- 
perturbatively in a certain class of supersymmetric Yang-Mills theories, such 
as N = 4 super-Yang—Mills theory. 

Similarly, it was noticed years ago that the D = 11 supergravity equations of 
motion, when toroidally compactified, obeyed peculiar “hidden” symmetries 
and duality relations. But the origin of these duality relations were totally 
obscure, so they were thought to be a curiosity, rather than a fundamental 
observation. 

Only recently have all these speculations concerning duality and nonpertur- 
bative field theory been pulled together to give a comprehensive picture, giving 
us unprecedented insight into the nonperturbative structure of string theory. 
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Duality is emerging as the key to understanding one of the deep mysteries of 
string theory, such as why there are five of them. It’s puzzling that there should 
be five perfectly finite, self-consistent field theories which combine gravity 
with other quantum forces. 

For example, in the Green-Schwarz formalism, notice that there are five 
ways in which we can construct a supersymmetric generalization of the 
Nambu-Goto action. For a supersymmetric theory, we are allowed to intro- 
duce a 32 component Majorana spinor, which can be decomposed into two 
16-component Majorana—Wey] spinors @'-’. (Introducing more spinor com- 
ponents will generate massless spin 3 fields, which are probably inconsistent 
quantum mechanically, and also isomultiplets of gravity, which would violate 
the equivalence principle.) If 6. isa Grassmann field, of either positive or nega- 
tive chirality with 16 components each, then we can construct a supersymmetric 
generalization of 0; X": 


Ie = a,X" — 16, 19,0, @41.3.1) 
which is invariant under the global supersymmetric transformation 
10 «= wae 
OX = aero: am 


If we have only one chiral fermion 6, on the world sheet, then we have 
N = 1 supersymmetry (which describes the heterotic string). If we have 4! 
and 62 of opposite chiralities, we can combine them into single 32 component 
Majorana spinor 6 with N = 2 supersymmetry. This will give us Type IIA 
string theory. 

If we have 6} and 6; of the same chirality, then this gives us chiral Type IIB 
string theory with N = 2 supersymmetry. We then find three ways in which 
to construct a global invariant IT}: 


8; X" — i6,143;0, heterotic, 
Mt = % a,X" —id“a,6 IIA, (11.3.3) 
aX" —i5;,0/0"3,0% TB. 


Then the first part of the superstring action 5; can be written as the 
generalization of the Nambu—Goto action: 


gi = TH; Nyy, 
S, = -1 f a’o/=aet Rij- (11.3.4) 


We obtain five superstring theories in all, since there are two types of het- 
erotic strings, and the Type I string is derived from the Type IIB theory by taking 
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its world sheet parity even sector (and adding open strings and Chan—Paton 
factors to cancel anomalies). 

Although this Nambu—Goto term is explicitly invariant under a global super- 
symmetry transformation, it is not locally invariant, and hence has the wrong 
number of degrees of freedom. To remedy this, we must add a Wess—Zumino 
term. We define an invariant form hwz: 


11" d6,T, dO, heterotic, 
hwz = 4 U1" d6r,0), d6 IIA, (11.3.5) 
SU" doi, dol IIB. 


where I"); is the product of all T matrices and S;; is a two-by-two matrix which 
equals the Pauli matrix o-. hwz can be expressed as hwz = db. If we carefully 
extract b from hwz, then the Wess—Zumino-like part of the action is given by 


T ‘fe 
= 5 [Poe (11.3.6) 


Then the string action equals S$; + 5). 

When we include both types of heterotic strings (with Eg ® Eg and SO(32) 
symmetry) and also the open and closed string sector of Type I strings, then 
we obtain all five superstring actions. (Usually, when we refer to superstring 
theory, we are actually referring to the Ex ® Ex heterotic string.) 

The differences and similarities of these theories can be seen by analyzing 
the zero-mass sector of these theories, which reduces to supergravity in 10 
dimensions. The five superstring theories, and their supergravity reductions, 
are given by: 


e Type IIA string theory reduces to N = 2A (nonchiral) supergravity. 

Type IIB string theory reduces to N = 2B (chiral) supergravity. 

e E; @ Eg heterotic string theory reduces to N = | supergravity coupled to 
an Eg ® E, Yang—Mills multiplet. 

e SO(32) heterotic string theory reduces to N = | supergravity coupled to 
an SO(32) Yang—Mills multiplet. 

e Type | string theory, which contains both open and closed strings, reduces 
to N = | supergravity coupled to an SO(32) Yang—Mills multiplet. 


Notice that each of the five superstring theories are significantly different 
from the others. For example, Type I theory is based on both closed and open 
strings, while the others are based on only closed strings. Similarly, the gauge 
structure of the five string theories, except for the last two, are markedly dif- 
ferent in their low-energy structure. The mystery of why there should be five 
string theories, each of which apparently unifies gravity with the other quantum 
forces, is solved when we begin to analyze various dualities (see Fig. 11.1). 
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FIGURE 11.1. The web of dualities in M-theory. By compactifying M-theory down 
to 10 dimensions, we can derive Type IIA strings and the heterotic strings. By 
compactifying down to nine dimensions, all five string theories can be unified. 


11.4. T-Duality 


To understand how unusual 7 -duality is, recall the fact that, in ordinary point 
particle field theory, compactification of one dimension leads to a periodic 
momentum 
n 

a (11.4.1) 
for a circle of radius R and an integer n. If we take the limit R — 2%, we 
find that the momentum becomes continuous, and hence we retrieve the full 
uncompactified theory. But now take the limit R — 0. Here we find that the 
momentum becomes either 0 or oo, and hence the compactified dimension 
effectively decouples from the theory. We see, therefore, that the two limits 
are entirely different in point particle field theory. 

Now consider compactifying, say, the ninth dimension, of the closed string 
theory. Now an additional complication appears which did not exist for the 
point particle theory: the string can wind around the compactified dimen- 
sion. We know that the momentum operators take the following values 1n this 
compactified theory: 


n n 
(P,, Pr) = | — +mR, ——mR}, i422 
2R 2R ( ) 
where the n, as usual, arises from the Kaluza Klein excitations of the circle, 
but mm labels the number of times the string winds around the circle. 
Notice that the mass spectrum for M° is invariant under [9]: 


Rw 4. 
arr (11.4.3) 


when we interchange n <> m. 
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This is a highly unusual symmetry, one which links the large-scale behavior 
of string theory to its small-scale structure. Unlike point particle field theory, the 
string cannot differentiate between these two regions. Notice that this duality 
symmetry, which interchanges winding modes with Kaluza—Klein modes, is 
strictly a result of the geometry of string theory and does not appear in point 
particle theories. 

If we rewrite this duality transformation in the familiar language of 
conformal field theory, this transformation is equivalent to making the 
substitution 


aX —> dX, 
aX > —AXx. (11.4.4) 


Notice the sign change for one set of movers. When we apply this same du- 
ality to superstring theory in the Neveu-Schwarz—Ramond (NS-R) formalisin, 
we similarly find 


vi > 
Wn > we. (11.4.5) 


But this transformation of the ninth left-moving oscillator also reverses the 
sign of the 10-dimensional left-moving chirality operator which is constructed 
from the fermionic zero modes: 


Pn =wevivi.. vis Tu. (11.4.6) 


so it flips the chirality of the left-movers. The Type IIA spinors, defined in terms 
of positive and negative chiralities, are transformed by T -duality into a theory 
with the same type of chirality, i.e., the Type IIB theory. In nine dimensions, 
we find the following duality emerging [10]: 


T: WA < IIB. (11.4.7) 


(We will use the symbol < to represent symbolically the duality relationship.) 

We can therefore view Type IIA and Type IIB as merely two extreme points 
along the same continuum of vacua labeled by R. As R — oo or R — 0, we 
retrieve the usual Type II string theories in 10 dimensions. 

Thus, the five superstring theories have now been reduced down to four. 

Yet another 7 -duality can be found by analyzing the Es @ Eg and the SO(32) 
heterotic string. Consider first the Narain lattice. Let us compactify the het- 
erotic string down to d dimensions. This means we must compactify 26 — d 
left-moving dimensions and 10 — d right-moving modes. Let us compactify 
these dimensions onto a lattice [', and Ir. We recall that by constructing the 
single-loop heterotic string amplitude and demanding that it satisfy modular 
invariance, we have the additional constraint: 


ped lp lp ad, (11.4.8) 


where Z refers to the integers. 
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The fact that there is a minus sign in the metric means that this is a Lorentzian 
lattice. Modular invariance also demands that these lattices be even, self-dual, 
and Lorentzian, but that still leaves considerable freedom in choosing the lat- 
tices. In fact, we can always preserve the previous constraint by rotating the 
lattice by SO(26 — d, 10 — d) and still satisfy modular invariance, with each 
configuration representing a different vacuum [11]. 

The group SO(26—d, 10—d), however, is still too large to typify the moduli 
space of possible string vacua. There are still redundancies. The mass operator 
is still invariant under separate SO(26 — d) ® SO(10 — d) rotations acting 
separately on the lattice of the left- and right-moving modes. 

Finally, we must divide out by the action of 7-duality to find the moduli 
space of inequivalent vacua. The discrete group 


P= S006 a0 ae (11.4.9) 


merely reshuffles the lattice points and hence does not change any of the 
physics. This is the generalization of R —> +R and simply maps one vacua 
into other equivalent vacua. : 

We find, in summary, that the moduli space which describes the space of 
distinct, inequivalent vacua of the heterotic string is therefore 


S0(26 — d, 10 — d) 
S0(26 — d) @ SO(10 —d) @T 


where the number of independent parameters in M,,,,, is mn. 
(In the literature, M¢_4 19a is often written as 


SO(26 — d, 10 — d, Z)\SO(26 — d, 10 — d)/SO(26 — d) @ SO(10 — a), 
(11.4.11) 
but for clarity, we shall use the notation which simply divides the naive moduli 
space by all the redundant factors.) 

This symmetry must also be reflected in the space of scalar fields of the 
effective supergravity theory, since the vacuum expectation values of these 
scalar fields labels the possible vacua. For example, in the heterotic theory, if we 
compactify c dimensions, then the scalar particles have the following number of 
components given by the reduced internal metric g,,, internal antisymmetric 
two-form 8;;, and internal Cartan gauge fields A’, where i represents the 
compactified coordinates 


moduli space = Mo6_4 10-4 = (11.4.10) 


l 
‘iS ee Ly; 
8ij xelc Ta 
ee mG aii (11.4.12) 
A; : cn, 
fora total of c? +cn scalar particles, where n = 16 for the heterotic string. This 


is precisely the dimension of the coset moduli space SO(c +1, c)/SO(c +n) ® 
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SO(c). So the scalar fields parametrize the moduli space of possible vacua, as 
expected. 

The important lesson is that the vacua transformed by the T-duality group 
SO(26 — d, 10 — d, Z) contains both the Ex ® Eg heterotic string as well as 
the SO(32) heterotic string as extremal points in the same space. Symbolically, 
we find [12], 


T: E,@ Es; S032). - (is) 


Thus, we now have gone from four superstrings down to three. 

We can perform a similar analysis of the lattice compactification of the 
Type II string, where we compactify simultaneously in both the left- and right- 
moving sectors. Not surprisingly, we find that the T-duality group is again 
given by T = SO(10 — d, 10 — d, Z). However, the full moduli space of 
inequivalent vacua for Type IIA and Type IIB theories is considerably more 
complicated than simply a combination of the orthogonal groups because of 
the presence of extra scalars, as we will see in the next chapter. 


11.5 S-Duality 


So far, we have only analyzed T-duality, which is perturbative in the string 
coupling constant and hence sheds no light on the nonperturbative nature of 
these theories. More interesting is S-duality, which links the weak coupling 
region of one theory to the strong coupling region of another. Since S-duality 
is inherently nonperturbative, we expect completely new surprises to emerge 
[13—16]. 

We will find that the dilaton field is central to this discussion of nonper- 
turbative effects because its expectation value is related to the string coupling 
constant. The first quantized string Lagrangian contains not only the usual 
term describing the area of the world sheet, but also a coupling to the Riemann 
curvature of the world sheet 


JB (g4?dgX" 9, Xp + PR”). (11.5.1) 
We use the fact that the Euler number for a two-dimensional Riemann surface 


of genus g is given by 


x= 5 | Pov=er®. (iile5:2) 
IT 


Therefore if we make the substitution ¢ > ¢ + (@), the Euclidean path- 
integral e~* gains a new term 


et, -Renae, (11553) 


In general, each n-point string amplitude is multiplied by the coupling constant 
factor g?8-°*", where n is the number of boundaries or external strings. If we 
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place e'*) at each vertex function of the Riemann surface, we can reabsorb this 
factor into the string coupling constant g, by redefining it as 


2, =e? (11.5.4) 


Thus, the coupling constant is related to the vacuum expectation value of the 
dilaton field. 

The key point is that S-duality, which changes (¢) into —(@), therefore 
connects the strong coupling with the weak coupling region. Let us see how 
this works in several examples. 


11.5.1 Type IIA Theory 


S-duality reveals a profound, counterintuitive result when applied to Type ITA 
strings, where we find a mysterious | 1-dimensional structure emerging. 

We begin our discussion by considering the original |1-dimensional su- 
pergravity theory [17], which was historically dismissed because it was 
nonrenormalizable and could not accommodate chirality. We will now rean- 
alyze this theory from the perspective of duality, adding in new terms which 
will cure both of the original problems. 

The bosonic version of this theory contains the 1 1-dimensional metric 247 
as well as the third-rank, antisymmetric tensor Ay yp: 


1 
aes dx {/—g[R+ SIFI’] 


1] 


MN Fey ty Fs... AMoMioM) , (11.5.5) 

(72) 
where Fy, ... uv, is the field tensor constructed out of antisymmetrized derivatives 
of Am, M>M3° 

Now compare this with the effective Type IIA action in terms of massless 
fields, once we integrate out over higher fields. We couple the string variables to 
the massless fields of the theory and treat them as background fields; in addition 
to the graviton g,,,,and dilaton @, we have an antisymmetric, second-rank tensor 
B,,, coming from the product of two Neveu-Schwarz fields. (We recall that the 
bosonic sector of the closed string in the Neveu Schwarz—Ramond formalism 
comes from the product of two NS operators or two R operators, coming from 
the left and right movers, i.c., the bosonic spectrum is spanned by the states 
NS; ® NSp or R; @ Re. R-R states exist for Types IA, IIB, and I strings, but 
there are no R—R states for the heterotic string.) 

In fact, both Type IIA and Type IIB have the same massless fields in this 
NS-—NS sector 


NS-NS: {?, Bis Baal : (11.5.6) 
But the R-R sector for the Type IIA theory contains the additional fields 
R-R:  {Cy, Ayvp} - (ise 
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Now let us write the bosonic part of the effective action of Type ILA theory 
(to lowest order) in terms of massless fields, which is just 10-dimensional 
nonchiral supergravity with N = 2 supersymmetry. If we let K = dC, H = 
dB, and G = dA, then the action for the massless fields (to lowest order) is 


S= f ax {J=ge* [r+ slag? — 1H] 


— J/—g [IK + ZIG?]}+ a i GAGAB. (11.5.8) 

Years ago, it was noticed that if we compactify the 11-dimensional su- 
pergravity theory on a circle, and let the radius of the circle go to zero, we 
obtain 10-dimensional Type IIA supergravity theory. However, the true phys- 
ical meaning of this correspondence was not understood at that time. Now we 
wish to analyze this correspondence in much greater detail. 

Notice that the actions for the 11-dimensional supergravity and the Type 
IJA string theory (to lowest order) are identical if we decompose the 11- 
dimensional metric tensor gyy and the three-form Aywp into the following 
10-dimensional fields: 


gun > (Suv, Cu), 
Amnp = (Ajo: Bi) ’ (11.5.9) 


where the radius of the eleventh dimension is given by e7¢/?: 
More precisely, the 11-dimensional metric tensor and antisymmetric field 
can be decomposed as 


ds’ = 2MN dx! dx = Cas, dx dy” 
+ ¢/? (dy — dx"C,) ; 
A = gdx" A dx” A dx? Ayyp + 3x" A dx” A dyByy. (11.510) 


At first, we may criticize this result because |1-dimensional supergrav- 
ity compactified on a circle has Kaluza—Klein states which are not seen in 
the 10-dimensional Type IIA string theory. However, as we will see later, 
these Kaluza—Klein states actually do exist in 10-dimensions, disguised as 
solitonlike solutions. When we compare the Kaluza—Klein states coming from 
1 1-dimensions with the solitonlike states arising in 10-dimensions, we will see 
that they are identical. In other words, the 10-dimensional Type IIA (with its 
solitonlike states) is actually an 11-dimensional theory in disguise! 

Similarly, Type IIA theory expressed in terms of massless fields is highly 
nonpolynomial in the curvature tensors, while | 1-dimensional supergrav- 
ity is not. Therefore, we expect that Type IIA string theory, rewritten as 
a 11-dimensional theory, includes 11-dimensional supergravity only as its 
low-energy limit. 

We are then led to postulate the existence of an entirely new | 1-dimensional 
theory, called M-theory. Our conclusion is then as follows: there exists anew 
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11-dimensional theory, called M-theory, containing 11-dimensional super- 
gravity as its low-energy limit, which reduces to Type IIA string theory (with 
Kaluza—Klein modes) when compactified on a circle. Specifically, the strong 
coupling limit of Type IIA is M-theory. 

This is truly aremarkable result [18-19]. It indicates that the strong coupling 
limit of 10-dimensional Type IIA superstring theory is equivalent to the weak 
coupling limit of a new | 1-dimensional theory, whose low-energy limit is given 
by 11-dimensional supergravity. More precisely, we have 


Ries (iS) 


From this, we see why previous efforts failed to notice this deep correspon- 
dence. Using perturbation theory around weak coupling in 10-dimensional 
Type IIA superstring theory, we would never see |1-dimensional physics, 
which belongs to the strong coupling region of the theory. As (@) becomes 
large, we enter the strong coupling region of Type IIA string theory, and 
R,, — 00, so the 11-dimensional nature of M-theory becomes apparent. 

We also see indications that M-theory is much richer in its structure than 
string theory. In M-theory, there is a three-form field A yy, which can couple 
to an extended object. We recall that in electrodynamics, a point particle acts as 
the source of a vector field A,,. In string theory, the string acts as the source for 
a tensor field B,,,,. Likewise, in M-theory, a membrane is the source for A 47, p. 
Although strings cannot exist in 1 1-dimensions, we see that other superobjects 
can, such as supergravity, supermembranes, and its dual. the super 5-brane. 
(We will explicitly construct the actions for these super p-branes in a later 
chapter.) 

All these astonishing results, however, raise more questions than answers. 
For example, although M-theory explains the strong coupling behavior of Type 
IIA superstrings, it does not tell us what the full nonpolynomial action of the 
1 1-dimensional theory is. We only know its low-energy part, which is 11- 
dimensional supergravity. At present, there is no known way to systematically 
derive the entire nonpolynomial theory. Also, we know very little about the 
quantization of these membranes and S-branes in |1 dimensions, and hence 
cannot make definitive statements about their interactions. 

Ironically, 11-dimensional supergravity was previously rejected as a 
physical theory because: 


(a) it was probably nonrenormalizable (i.c., there exists a counterterm at the 
seventh loop level); 

(b) it does not possess chiral fields when compactified on manifolds; and 

(c) it could not reproduce the Standard Model, because it could only yield 
SO(8) when compactified down to four dimensions. 


Now we can view |]-dimensional supergravity in an entirely new light, as 
the low-energy sector of a new | 1-dimensional theory, called M-theory, which 
suffers from none of these three problems. The question of renormalizability 
is answered because the full M-theory apparently has higher terms in the cur- 
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vature tensor which render the theory finite. The question of chirality is solved 
because although 11-dimensional supergravity cannot give us chirality when 
compactified on a manifold, M-theory gives us chirality when we compactify 
on a space which is not a manifold (such as line segments). And the problem 
that SO(8) is too small to accommodate the Standard Model is solved when 
we analyze the theory nonperturbatively, where we find Eg @ Eg symmetry 
emerging when we compactify on line segments. 

We can also see why physicists missed this rather simple correspondence 
for many years. Because the coupling constant grows with the radius of com- 
pactification, perturbation theory will never reveal this eleventh dimension to 
any finite order in the expansion. This correspondence between 11- and 10- 
dimensional physics is inherently a nonperturbative one. We represent this 
S-duality relationship symbolically as 


S: MonS, = IIA. (11.5.12) 


11.5.2 Type IIB Theory 


The S-duality between Type IIA and M-theory is so remarkable that it must be 
checked in as many ways as possible. Because of the web of S- and T -dualities, 
there are a number of self-consistency checks we can make. 

Since Type IIB strings are linked to Type IIA strings by T-duality in 
nine dimensions, there should be a nontrivial link between 1 1-dimensional 
supergravity and Type IIB strings (to lowest order). 

In addition to the fields coming from the NS-NS sector, there are also Type 
IIB fields coming from the R—R sector, which consist of another antisymmet- 
ric, second-rank tensor B’ , in addition to a scalar field /, and a fourth rank, 


: eau > 
antisymmetric tensor Cyyop: 


ReR: {1B i. Caos (ils:13) 


pv? 
The effective action for Type IIB theory, when all higher modes have been 
integrated out, is (to lowest order) 


S= [atx {e-**TR + 4|do|? — 4|H 7] — 2\dl|? 


/ ?2 l / 
— 4H’ —1Hf— SiIM*P}— = fect AH AH‘ (15.14) 


where we have dropped all fermion terms and all higher terms in the curvature, 
where H = dB and H’ = dB’, and where M = dC and M is self-dual. (In 
this action, we have deliberately neglected the rather subtle point that there 
exists no simple covariant action of a self-dual antisymmetric field.) 

In order to show S-duality, we will make the substitution g,,, > e7?/? 2,., 
which gives us 


S= fax {R —2[ldol? + e**|dl?] — GlMtP — fe* lal? 
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Let us compactify the Type IIB theory ona circle ofradius R jo. [four conjecture 
about M-theory is correct, then this should be equivalent to the compactification 
of M-theory first on a circle of radius R},, and then on a circle of radius Rio. If 
we compare | 1-dimensional supergravity compactified on 5; @ S, with Type 
IIB theory compactified on S,, we find that they are identical, if we make the 
following identification: 


Ri 

=—. 116 
SIIB RG ( ) 
But notice that the 5; @ S, forms a torus, and the modular group of the torus 
is SL(2, Z). In particular, a torus described by R,/Rjo is related to a torus 
described by Rjo/R\,. But this means that the Type IIB theory described by 
coupling constant g is equivalent to the Type IIB theory described by |/g, i.e. 

the theory is se//-dual. This is a nontrivial prediction of M-theory. 
To check this prediction, we first notice that the Type IIB action is indeed 

invariant under an SL(2, R) symmetry given by 


at+b 
: 15.17 
~ ct+d ( ) 
where 
tr=li+ie™, (1 145. Ti) 


where a, b, c,d are real numbers obeying ad — bc = 1, where / is the axion 
field, and where we simultaneously make the transformation on the two-forms 


B eG B 1151 
(a }>(5 a) Ce): iis 


More specifically, we can show this invariance by introducing the matrix: 


li% 2 ) a b 
M =e? , NM = ; 5). 
e ( 4 - ) (11.5.20) 
which transforms under SL(2, R) as 
M=AMA’. (11.5.2 


We can also put H and H’ into the column matrix H, such that 


H r, T\-1 7, ; 
H=| |). Pam (11.5.22) 


Then the Type IIB action (after a simple rescaling) which is manifestly 
invariant under SL(2, R) is given by 


S= f dxJ=e (R= LAL MAM + Leora, MoD). (115.23) 
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where we have dropped the fermionic terms, the self-dual tensor, and the higher 
interactions. 

When this classical symmetry of the lowest order action is quantized, 
SL(2, R) reduces to the subgroup SL(2, Z), so S-duality is given by 


S = SL(Q, Z). (11.5.24) 


What is astonishing is that this S-duality relationship is defined as an abstract 
relationship between fields in the Type IIB theory, but is defined geometrically 
in M-theory over physical space, which shows the power of this formalism. 
If we set / = 0, this SL(2, Z) symmetry, as a subset, contains the important 
invariance 


~—> —-¢. (11e5.25) 


In other words, the strong coupling of Type IIB theory, which is normally 
beyond the realm of perturbative methods, is now revealed to be another Type 
IIB string, as predicted by M-theory. Thus, Type IIB string theory is self-dual. 
This is indeed a surprising result, demonstrating that S-duality can establish 
a relationship between the strong coupling of one string theory with the weak 
coupling of another. We find 


S: IIB < IIB. (11.5.26) 


11.5.3 M-Theory and Type ITB Theory 


We can also establish a direct relationship between M-theory and Type IIB 
theory, although it is an awkward one. Before, we compactified M-theory on 
T, which is equivalent to Type IIA theory compactified on S; (which in turn 
is T-dual to Type IIB theory). The 7) of M-theory compactification, we saw 
earlier, can be parametrized by an SL(2, Z) modular symmetry, which in turn 
helps to give a geometric explanation of the SL(2, Z) symmetry of the Type 
IIB theory. Unfortunately, this relationship between M-theory compactified on 
T, and Type IIB compactified on S| only holds in nine dimensions. In order to 
retrieve the Type IIB theory in 10 dimensions, we need to let Rj go to zero 
(which is equivalent by 7 -duality to letting the radius of compactification of 
Type IIB theory go to infinity). This means that M-theory compactified on a 
torus of zero area and fixed shape is equal to Type IIB theory in 10 dimensions, 
which is a rather awkward relationship. 
If we let 77 symbolically represent this limit, then we have 


M-theory on S$; <_ IIA, 


(UE3.27) 
M-theory on 7, <> IIB. 


Lastly, we may puzzle over several strange facts. First, it appears odd 
that 11-dimensional supergravity cannot reproduce a chiral theory after 
compactification, but M-theory can. In fact, the lack of a chiral interpreta- 
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tion for 11-dimensional supergravity was an important reason for originally 
eliminating it on physical grounds. 

For example, M-theory compactified on 7) reduces to D = 9, N = 2 
supergravity coupled to a tower of KK states plus higher terms. In the limit 
that we let the area of the torus go to zero for fixed shape, we recover the chiral 
supergravity theory. So starting from a nonchiral theory, we wind up with a 
chiral one. Where do the chiral states come from? 

The answer is that M-theory has higher corrections to 11-dimensional su- 
pergravity. In particular, M-theory also contains massive states arising by 
wrapping up a membrane on 7». In the limit that the area of the torus goes 
to zero for fixed shape, we find that these massive modes become massless, 
such that the effective theory reproduces D = 10 chiral Type IIB superstring 
theory. The lesson here is that M-theory contains, in addition to 1 |-dimensional 
supergravity, higher p-brane states which cure the many “diseases” of the for- 
mer theory. These higher p-brane states, in particular, are important in unifying 
all five superstring theories into one. 

A second mystery is related to the fact that the Type IIB string contains 
two background fields given by the tensor fields B and B’. It is well known 
that the Type IIB string acts as a source for the B field, via the coupling 
e'!9;,X"9;X"B,,, and hence has a charge under this field. Type IIB strings 
cannot act as sources for the B’ fields as well. But SL(2. Z) interchanges B 
and B’, so there must be another object, besides the Type IIB string, which has 
a charge under B’. But what is it? Like the Kaluza—Klein modes found when 
compactifying M-theory down to Type ITA theory, we will find that there 
are new BPS states which carry charge under B’ and yield the full SL(2. Z) 
symmetry. 

Before ending our discussion of Type IIB theory, let us comment on the 
factors of @ which appear in the various low-energy effective actions of string 
theory. We recall that the Euler number for a Riemann surface is given by 
Xx = 2-—2g. Fora sphere with no handles, g=0 and the Euler number is 2. The 
path integral over e~* picks up a factor of g~*, which gives us g, ~ for spheres. 
This means that the effective string action defined on the sphere should be 
accompanied by a factor of g-- = e~-*. Indeed, we find this factor for all 
string actions for fields coming from the NS—NS sector. For the Type ITA and 
IIB actions, these terms include the R, ¢, and H fields. 

However, the massless background fields coming from the R-R sector do 
not have any @ dependence at all, since these fields do not directly couple to 
the string world sheet. They cannot couple to the string world sheet via the 
usual way, via products of 0; X". Instead, they couple via spinor bilinears, such 
as SP" §. Tn general, this means that there will be Riemann cuts on the 
world sheet. This also means that the terms in the Type IL actions involving the 
R-R fields do not contain any ¢ dependence. 

Throughout this discussion, we will find that the R-R sector of string 
theory contains highly nontrivial information concerning the nonperturbative 
structure of string theory. 
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11.5.4 Eg ® Ex Heterotic String 


So far, by considering N = 2 superstring theories, we have reduced five 
superstrings down to three. But what about chiral superstring theories with 
N = 1? In particular, what about the Eg ® Eg heterotic string and the Type I 
string? Once again, we find many surprises [20]. 

We begin by compactifying the still-mysterious 11-dimensional theory, 
whose low-energy sector is 1 1-dimensional supergravity, on a finite line seg- 
ment, i.e., we first compactify on S|, giving us Type IIA theory, and then 
NC by a action of Z,. This discrete symmetry acts on 5; by the following: 
x > —X"” 

By dividing by Z, we also reduce the N = 2 symmetry of the Type IIA 
theory by half, down to N = 1. We also wish to keep the bosonic states which 
are even under Z», which include: the 10-dimensional metric g,,,, the scalar 
811.11, and the antisymmetric tensor A,,,,;. Notice that these bosonic fields 
with N = | supersymmetry make up precisely the spectrum of the Es @ Eg 
heterotic string (minus the gauge multiplet). 

The division by the line segment introduces anomalies into the theory, which 
must be canceled because M-theory is anomaly-free. To solve this problem, 
we naturally introduce two additional pieces to the action, each one associated 
with the endpoints of the line segment, which form two hyperplanes. We recall 
that we must have 496 vectors in an N = | theory to kill the anomalies. This 
can be solved by having 248 vectors defined on each hyperplane. We know 
that Eg can give us precisely 248 states, and we also know that Z,. symmetry 
demands that we have equal gauge groups on the ends of the line segment, so 
the entire theory has gauge symmetry Fs ® Eg or SO(32). In other words, we 
can cancel the anomaly by placing a super-£s Yang—Mills theory at each of the 
hyperplanes located at the endpoints of the line segment, so that the resulting 
theory has Eg ® Es symmetry. (At present, precisely how the super-Yang—Mills 
theory emerges from the compactification of M-theory on a line segment is 
still poorly understood.) 

As before, we find that the larger the string coupling constant, the larger the 
separation between these two hyperplanes, i.e., R = g°/”. Thus, to any finite 
order in string perturbation theory, we will never see the direct effects of these 
two hyperplanes. This is the reason why string theory previously missed these 
hyperplanes and the nonperturbative structure of the heterotic string. 

We summarize this symbolically by stating: 


S : M-theory on S|/Z) <> Eg ® Es. (1628) 


11.5.5 Type I Strings 


Now we have reduced the five superstrings down to two. We are still left with 
Type I strings, which, unlike the others, have both open and closed strings. 
The last correspondence is obtained by comparing Type I and SO(32) heterotic 
strings, which both have SO(32) symmetry. Although they realize the SO(32) 
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symmetry in entirely different ways (Type I strings via Chan—Paton factors, 
and SO(32) strings via the Frenkel-Kac construction), they have identical 
low-energy sectors. We suspect that they may be S-dual [21-22]. 

As before, let us now make a prediction about this relationship from M- 
theory. Let us first compactify M-theory on S,/Z. with a length given by L. 
We arrive at the £3 Es heterotic string. And now let us compactify again on $;, 
with radius given by R, which breaks the symmetry down to SO(16)@SO(16). 
We have then compactified M-theory on a cylinder, with radius R and length 
ii 

Now let us compare this resulting theory with Type I and the SO(32) theories 
compactified on a circle of radius R. The effective low-energy bosonic Type I 
action is given by 


Y= [axxv= {e~** [R + 4|do|’] — e* Tr JF? — $|H'?}. 
(11.5.29) 
Notice that the @ terms appearing in the Type I effective low-energy action 
have the correct character. The R and d@ terms appear multiplied by e~*¢, 
as expected, since they are defined on the sphere. (However, the H = dB 
field is missing.) The F term is accompanied by a factor of e ® because the 
Yang—Mills terms are associated with the open string sector, which is defined 
on the disk (with Euler number 1) rather than the sphere. And the H’ term 
comes from the R-R sector, and hence has no ¢ dependence. So all factors in 
the Type I action have the correct world sheet structure and @ dependence. 
The SO(32) heterotic action is given by 


s= f dx /=ge* [R+4ldgl— 4A -—a’TriFl}.  (11.5.30) 


Notice that this action has the correct @ dependence. Since the heterotic string 
has no R-R sector, we find that the entire SO(32) action is multiplied by a 
factor of e~*?, as expected. 

Now consider M-theory, compactified on either [ $,/Z>]@S, or §;@[S,/Z>). 
i.e., we can always reverse the order of compactification. 

In the first method, we compactify first on S;/Z> and arrive at the Es ® Ey 
theory. Then we compactify on S,, which gives us an SO(16)QSO(16) heterotic 
theory. This, in turn, is 7-dual to the SO(32) theory compactified on S). 

In the second method, we reverse the order. We first compactify first on 
S|, which gives us Type IIA theory. Then if we compactity again on S$; /Z>, 
we break N = 2 supersymmetry down to N = |. By carefully analyzing the 
structure of this N | theory, we find that it is a 7-dual version of Type I 
theory, with group structure SO(16)® SO( 16). Since both theories must be the 
same, we therefore have a relationship between Type I theories and heterotic 
strings 


S: SOB2) <1. SS) 
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Comparing these two theories (which were derived by simply reversing the 
order of the compactification of M-theory) we find 


R 
81 = 1/8s0G2) = a: (11.5.32) 


In the previous case of M-theory compactified on a torus, we found that 
the modular symmetry of the torus could be used to show the equivalence of 
Type IIB theory with coupling constant g with Type IJB theory with coupling 
constant 1/g. This gave us the self-duality of the Type IIB theory. However, 
in the case of M-theory compactified on the cylinder, there is no counterpart 
of the modular symmetry of the torus, so we do not expect Type I theory to 
be self-dual. However, what we find is that Type I theory is dual to SO(32) 
heterotic strings. 

More precisely, we have the following transformation which converts the 
Type I theory into the SO(32) theory: 


Suv > enter, 

¢— -9¢, 

B' > B, 

A= a’ A. (1.5.33) 


We can also find a relationship between M-theory and SO(32) theory in 
10 dimensions, although it is rather artificial. Like the relationship between 
M-theory and Type HB theory, we have to take a rather awkward limit. We 
start with the fact that M-theory compactified on a cylinder, 1.e., [S; /Z2] @ $), 
is equal to SO(32) string theory on S$). To retrieve the SO(32) theory in 10 
dimensions, we have to take the zero area limit for fixed shape. So we have 


M-theory on S| /Zp ~ Ex, ® Es, 
M-theory on Tj /Z2 < SO(32). (11.5.34) 


In addition, M-theory sheds some light on the reason why Type I theory 
contains both open and closed strings. The Type I closed string theory, we 
recall, can be viewed as a truncation of the Type IIB closed string. We note 
that the Type IIB closed string theory, because it contains left and right movers 
of the same chirality, is invariant under the world sheet parity operation 2, 
which exchanges left and right movers, and sends o into —o. We can therefore 
project out of the Type IIB theory the subsector which is parity even under &2. 
This means that the Type I closed string is an “orientifold” of the Type IIB 
theory. This truncation gives us the N = | unoriented Type I closed string. 
However, the two-form B,,,, found in the Type II theory couples to the string 
via e'/ 0; X" 0; X", which has odd parity under the world sheet parity operation 
@, and is hence projected out. Thus, 8,,,, cannot couple to the Type I closed 
string. Because Type II strings contain this term e/d;X"0;X"B,,,, it means 
that Type II strings have a conserved charge under this two-form, 1.c., the Type 
II strings act as sources for the two-form field, as in ordinary gauge theory. 
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However, the absence of B,,,, for the Type I theory means that Type I closed 
strings do not have a conserved charge, i.e., they can break into open strings. 
This is the reason, in fact, why the Type I theory closed strings are unstable, 
because they do not have a conserved charge under B,,,. 

(In the same way that the electric charge of a point particle is equal to the 
surface integral over a sphere of the dual of the Maxwell tensor, the charge 
associated with the string is equal to the surface integral of the dual of the field 
tensor over a seven-sphere. If a closed string were to break, then the seven- 
sphere could slide off the string, and the surface integral would equal zero. 
For Type II strings, where the charge is conserved, the charge cannot suddenly 
equal zero, and hence the Type II closed string cannot break. However, this is 
not true for the Type I closed string, which has no such conserved charge, and 
it can break.) 

Lastly, there is one more symmetry, called U-duality, where a theory with 
large radii and strong coupling is dual to a theory with small radii and weak 
coupling. Summarizing all three, we have 


S-duality; ¢— —¢, 
T-duality: > —w, (1.5.35) 
U-duality: w—> +¢, 


where w is a scalar field whose vacuum expectation value controls the radius 
of compactification. In the literature, U-duality [23] also sometimes refers to 
the minimal group which contains both S- and T -dualities, 1.e., 


USS@ T. (11.5.36) 


We will discuss U-duality more when we analyze BPS saturated states. 


11.6 Summary 


Let us summarize our results so far. We have found that S$, 7, U dualities 
provide a powerful way in which to analyze the perturbative and nonperturba- 
tive structure of string theory. In particular, they show that all five superstring 
theories are really the same theory, and that they may ultimately be unified 
into a theory called M-theory. Although the action for M-theory is only known 
to the lowest level (where it reduces down to | 1-dimensional supergravity), 
the theory has already given us unparalleled insight into the nonperturbative 
behavior of string theory in various dimensions. 

Our discussion began with 7-duality, which is based on the fact that the 
momenta of the compactified closed string is given by 


n n 
(Pi. Pr) = (55 +mR, ee (11.6.1) 


where the n, as usual, arises from the Kaluza—Klein excitations of the circle, 
but m labels the number of times the string winds around the circle. Notice 


11.6 Summary 477 


that the mass spectrum for M7? is invariant under 


1 
Re —, 6). 
a GirG.2) 
when we interchange n <— m. 
Rewritten in the language of super conformal field theory, this transforma- 
tion is equivalent to 


0X > 0X, 

ax => —dXx, 

ve Wr, 

Wr—> Wr. (11.6.3) 


This, in turn, reverses the sign of the chirality operator on one set of states, and 
hence changes the chirality of a Type IIA theory into a Type IIB theory. Thus, 
we have the result that 


TAB: (11.6.4) 


So five superstring theories have been reduced down to four. Similarly, we 
can show that the two heterotic strings are T-dual. Naively, the moduli space 
of vacua for the heterotic string compactified down to d dimensions is given 
by SO(26 — d, 10 — d). We arrive at this result by analyzing the one-loop 
amplitudes and demanding that they be modular invariant. 

However, this is too large. We can still divide out by SO(26—d) and SO(10— 
d), since the mass operator is separately invariant under these groups. Lastly, 
we can divide out by the 7 -duality group SO(26 —d, 10—d, Z) which merely 
shuffles the lattice points into each other. This 7-duality group shuffles the 
Es ® Eg theory into the SO(32) theory. Then the full moduli space is given by: 


BC(26—d,10—d) 


Ba a i (11.6.5) 
SO(26 — d) ® SO(10 — d) @T 


moduli space = 


where 
T = SO(26 — d, 10 —d, Z). (11.6.6) 


Then five superstring theories have been reduced down to three. Although 
these T-dualities were interesting when they were first discovered, the real 
impact was made when the S-dualities were first discovered. Although the S- 
dualities have not been proven rigorously, they are believed to be true because 
of many independent checks. 

We start by reanalyzing the 1 1-dimensional supergravity action 


] 
S=—> dx {/—g[R+41FV] 


It 


eM M)...M\| Fy... LM A Ma Mig Ms A (1 1.6.7) 


(72) 
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where F'y,...y, is the field tensor constructed out of antisymmetrized derivatives 
of Am,.m.Ms- 

It was noticed years ago that this action, when compactified on a circle 
of vanishing radius, reduced down the Type IIA supergravity action. After 
compactification of the 11-dimensional theory, we find the correspondence 
with the 10-dimensional Type IIA supergravity metric 


ds? = gyy dx™ dx™ = e779, dx" dy” 
+ et? (dy — dx"C,)° ; 
A = tdx" Adx” Adx? Ayy, + 5dx" Adx"dyByy. (11.6.8) 
Let K =dC, H =dB,and G = dA. Then we have the Type IIA action 


S= [ars {/=ge **[R + 4d)? — 5|A|’] 


— J=g (IK? + gIGl]}}+ aa / GAGAB. (11.6.9) 

Although this result was known for many years, it was considered a curiosity 
for many reasons. First, | 1-dimensional supergravity is a sick theory. Second, 
this result was valid only for Type IIA supergravity, not the full superstring 
theory. Third, there were Kaluza—Klein states arising from the compactification 
that had no counterpart in either Type IIA supergravity or superstrings. 

All these problems can be solved if we introduce M-theory. First, the sick- 
nesses of 1 1-dimensional supergravity can be solved if we consider it to be the 
low-energy sector of a higher theory. Second, the complete Type IIA supergrav- 
ity action should arise if we compactify the higher terms of the | 1-dimensiona] 
theory. (We can, in fact, work backward and define the higher terms of 1|1- 
dimensional supergravity as emerging from Type IIA superstring theory.) And 
lastly, the Kaluza—Klein states of the compactified | 1-dimensional theory can 
be viewed as BPS saturated states of the superstring. 

Similarly, we can derive the Ey ® Ex heterotic string by compactifying M- 
theory on a line segment. Then an Eg symmetry emerges at each end of the 
line segment, giving rise to Ey ® Eg symmetry. (Ironically, | 1-dimensional 
supergravity was, in part, abandoned because it could be shown that the theory 
could not produce a chiral theory after compactification on a manifold. Now, 
we see that |1-dimensional supergravity can, in fact. vield chiral theories by 
compactifying it on spaces which are not manifolds.) 

When we compare the two actions, we find the correspondence 


Rye). (11.6.10) 


So we sec that the strong coupling limit of 10-dimensional Type ITA superstring 
theory yields the 1 1-dimensional M-theory. 

Other dualities can be found as well. We first notice that the Type IIB theory 
is self-dual under the discrete group SL(2. R). In addition to the tensor field 
B,. arising from the Neveu Schwarz sector, we also have the B’,,,. tensor 
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field coming from the Ramond—Ramond sector. These two tensor fields can 
be combined into a single column vector. 
To see this, we introduce: 


a} b 
M=e(" ak sol ok (11.6.11) 


which transforms under SL(2, R) as 


M=AMA’. (11.6.12) 

We can also put H = dB and H’ = dB’ into the column matrix H, such that 
; H , ae 

i ae Hae (Kh). (11.6.13) 


Then the Type IIB action (after a simple rescaling) which is manifestly 
invariant under SL(2, R) is given by 


bp 


s= f a%xV=2 (R- pH MH"? + iT" Ma,M~)), (11.6.14) 


where we have dropped the fermionic terms, the self-dual tensor, and the higher 
interactions. 

We also notice that SO(32) heterotic strings and Type I strings have the same 
perturbative gauge group. The effective low-energy bosonic Type I action is 
given by 


c= [ax fe? [R + 4|db|?] — e? Tr| FP? — 3]H'?}. (1.6.15) 
The SO(32) heterotic action is given by 
c= [atxs=e0 [R+4id¢) — 4H)? —a' TrlFP]. — (11.6.16) 


Comparing these two theories compactified on S, with M-theory compactified 
on S;/Z, and then by $;, we find 


R 
81 = 1/8s032) = 7 (11.6.17) 


In summary, when compactified on a circle S,, we found the following 
T -dualities in nine dimensions: 
- Type IA < Type IIB, 


(11.6.18) 


This, in turn, reduces the various superstring theories down to two categorics, 
those with N = 2 supersymmetry (the Type II theories) and those with N = | 
(the heterotic theories). 
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We also found the following S-duality relations in 10 dimensions: 


€ Type IB < Type IIB, 
, Type I = S0(32). 
The last relation is particularly important because it links the N = | 


superstring theories with the N = 2 theories. 
When going from 11 to 10 dimensions, we found the S-dualities: 


M-theory on S, < Type IIA, 
M-theory on T; <> Type IIB, 
M-theory on $|/Z. <> E3 @ Es, 
M-theory on T7/Z,. <> SO(32). 
Lastly, although these duality relations are quite impressive, we may still 
challenge many of these results because of their lack of rigor. We have only es- 
tablished the S-duality transformation to lowest order in the effective theories. 
Although this is still an unsolved problem, we have confidence in these results 
because of a number of self-consistency checks we can make, the most com- 
pelling being the analysis of BPS saturated states and p-branes. We now turn 
to these topics. 


(11.6.19) 


(11.6.20) 
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CHAPTER wiz 


Compactifications 
and BPS States 


12.1 BPS States 


As remarkable as these dualities may appear, we can still question whether 
they are accidents of our low-energy expansions or represent something deeper 
about the theory. We must establish other consistency checks on these nonper- 
turbative results. Perhaps the most persuasive evidence comes from something 
called “BPS saturated states,’ which allow us to probe nonperturbative phe- 
nomena because of the nonrenormalization theorems. In this chapter, we will 
see show these BPS saturated states allow us to establish a web of dualities in 
dimensions down to D = 4. 

To begin, let us analyze one of the simplest examples of a supersymmetric 
field theory in 1+1 dimensions. We start with the Lagrangian 


L = 30,6) + 5¥y".v -3V°0)—AV'@WY, (121.1) 


where V(¢) = A(d? — a“) and w is a Majorana fermion. This theory is 
supersymmetric, with two chiral supercharges given by 


oe / dx(b+ oa F VO)Ve, (12.1.2) 


where y; are the left- and nght-handed components of y. Then it is easy to 
write the supersymmetry algebra 


Oe 7 hagite 
Q? =P., 
(O.. 0 l=. (1213) 
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where P+ = Py + P,, and T is the central charge. From the algebra, we find 
the relationship 


eee, +O FO, Oy +T. (12,14) 


Now comes the key point. Since the square of the supercharges is always 
positive, this means that we can sandwich this relationship between eigenstates 
of Po, where the particle is at rest, and we find 


M > 3T. (25) 


This is the Bogomol’nyi bound, which establishes a relationship between the 
masses and charges of these states. In this chapter, we will be interested in 
those states which saturate this bound, i.e., for which the inequality becomes 
an equality. We find the equality is satisfied when we sandwich this equation 
between two states |s) which obey (Q, + Q_)|s) = 0,i.e., for states which 
are annihilated under precisely half of the supercharges. These are the BPS 
saturated states. 

This simple example in | + 1 dimensions illustrates the main features which 
we will also find in string theory. We will find that BPS saturated states for 
strings preserve half of the supersymmetry generators, and most important, 
we believe that the BPS relationship is preserved to all orders in perturbation 
theory due to the supersymmetry nonrenormalization theorems. 

Now let us generalize these results by considering the super translation 
algebra in four dimensions 


(Q!,, OF} = 84 (y"C)ag Py + UY (Cag + VA(Cys)ap (12.1.6) 


fori = 1,2,...N, whichcontains N(N — 1) central charges contained within 
the antisymmetric matrices U and V. In particular, for N = 4 and N = 8 
superalgebras, we have 12 and 56 central charges, respectively. Historically, 
these central charges were largely ignored, since they describe massive states 
and higher N theories. But for our discussion, they will be crucial in finding 
the BPS saturated states. 

As before, we can sandwich this algebra between eigenstates of Po, where 
we assume that the state is at rest. Then the expectation value of Po yields the 
mass terfn M. This algebra now reduces to 


M > kiZ| (12.1.7) 


for some constant k, where |Z| symbolically represents the charges that we 
obtain from U and V. In other words, the masses and charges are now related by 
this condition. This result gives us a powerful relationship between the masses 
of a supersymmetric theory and the charges of those states. For the states which 
saturate this relation, it is believed that supersymmetry protects them from 
being renormalized by quantum effects, so the relation holds nonperturbatively. 
This is the key reason why they play such an important role in establishing 
the validity of duality. The point here is that the BPS relationship is probably 
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satisfied nonperturbatively because of nonrenormalization theorems, giving us 
a powerful constraint on the properties of dual theories. 


12.2 Supersymmetry and P-Branes 


Now generalize this relation for the arbitrary case. As we will see, the presence 
of p-branes alters the relation to 


(COMORE=IR CG), Fe (PPG) SZ 2245 
Pp 


In the original work of Haag, Lopuszanski, and Sohnius, which categorized 
the representations of supersymmetry algebras, these higher central terms were 
dropped because they implicitly assumed only the existence of point particles. 
But if we allow higher p-branes to exist, then we necessarily have to add these 
higher terms to the super translation algebra. 

Previously, there were serious objections to higher p-branes. For example, 
the 11-dimensional supermembrane (2-brane) theory was rejected for three 
reasons: 


(a) it was unstable; 
(b) its three-dimensional action had ultraviolet divergences; and 
(c) even its free action was unsolvable; it was quartic and highly nonlinear. 


But M-theory forces us to reexamine this from an entirely new light. We saw 
before that M-theory can be represented, to lowest order, as | 1-dimensional 
supergravity. To higher orders, however, the action is basically unknown. In 
11 dimensions, the only known generalization of | 1-dimensional supergravity 
is the supermembrane, which might provide a clue as to the structure of M- 
theory. In particular, under a double-compactification (reducing one dimension 
of space-time and one dimension of the world volume), the 1 1-dimensional 
supermembrane reduces down to the usual Green—Schwarz string. 

To begin our discussion of membranes, consider an ordinary gauge theory 
coupled to a point particle source (i.e., a zero-brane). 

The coupling is given by 


farxayi (1222 
where the source j” is given by: 
j(xy= fac 5° (x, — X,(t)) 0-X*(r), (12.2.3) 


where X(t) labels the location of a point particle. This reproduces the familiar 
formalism of Maxwell’s theory of point particles. 
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Likewise, for a string (a 1-brane), the string variable X » couples to a 
massless, background second-rank tensor B,,, as follows: 


. i OSs) sada. (12.2.4) 
where 
j’'@)= il dt do 8” (x, — X,(t, 0)) €48;X"8;X”. (12.25) 


Inserting the current into the coupling, we find the usual term coupling the 
string to the massless background field 


[ aedorea,x"9)x° By (122-6) 


Similarly, this can be generalized to describe a gauge theory with a massless, 
background p + | rank antisymmetric gauge potential coupled to a p-brane. 
We introduce the p-brane variable X,,(0;), where o; label the coordinates of 
the world volume swept out by the moving p-brane. 

The coupling is given by 


/ deAV ee ae ee 


where 


eee el [artes Gr = X,,(0;)) Ela 0), Xe Ojmmeke 


(12.2.8) 
Now construct the field tensor associated with the p-brane potential: 


Fie op Ones + permutations: (12.2.9) 


To construct the dual theory, we introduce another field tensor F’ such that 
it equals the dual to F: 


(12.2.10) 


* — , 
Friniieas =F M1 ++-f4g+2° 


The left-hand side is a D — (p +2) rank tensor. The right-hand side is a g + 2 
rank tensor. Since these two numbers must be equal, we therefore have the 
relation 


p+q=D-4, (12.2.11) 


which gives us the dimension q of the dual of a p-brane. For example, in four 
dimensions, the dual of an electron (0-brane) is another 0-brane, the monopole. 
In 10 dimensions, the dual of a string is a S-brane. Likewise, in | 1-dimensions, 
the dual of a membrane is a 5-brane. 
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Lastly, we can construct the electric and magnetic charges corresponding to 
these p-branes. We find 
Of | ie 
Sp—p-2 


Qu ~ f F, (2212) 

Sp42 
where we have encircled the p-brane by a hypersphere. (The definition of 
electric versus magnetic charge, as we see here, is a bit arbitrary, since we can 
reverse them by taking the dual.) 

One simple guiding principle which determines which p-branes can exist 
is to analyze the central charges appearing in the supersymmetry algebra. 
Surprisingly, the supersymmetric algebra allows us predict which p-branes 
appear in which dimensions, even before performing any explicit calculation. 

For example, in Green—Schwarz superstring theory the action is invariant 
only up to a total derivative. Normally, this total derivative is dropped. However, 
if we calculate the currents corresponding to supersymmetry, we find that the 
algebra of the currents do not close in the usual fashion. In particular, this 
total derivative term contributes precisely the central charges found in the 
super translation algebra. By an explicit calculation, we find that the central 
charge contribution ofa p-brane is given precisely by a pth rank antisymmetric 
field Z,,,..,., appearing in the algebra. This is the key point: by examining the 
central charges Z,,,__,,, in the algebra, we can read off the spectra of p-branes 
in the theory. This is quite remarkable, because we now know the number of 
nonperturabative p-branes in the theory without ever having to construct them! 

Now consider the fact that the largest number of components of the fermionic 
generators that can appear in the superalgebra is 32. In 11 dimensions, we 
can construct Majorana spinors with 32 components. There should be 528 = 
(33 x 32/2) possible terms on the left- and right-hand side of the algebra. The 
algebra, including the central terms, is given by 


{Qa, Og} = (EG) Eye (ee) ZMN 
wy era Z\ V POR: ( eel) 


Each central charge term on the right-hand side corresponds to a p-brane. 
Counting states, we find that they contribute 


Ll + 55-462 528 states (12.2.14) 


as expected. Thus, we expect to find a membrane and its dual, a S-brane in 11 
dimensions. 

As a check on this formalism, we note that this membrane must couple to a 
third-rank tensor. If we examine the currents of | 1-dimensional supergravity, 
we have a third-rank tensor field A yy p, which couples to a 2-brane as expected. 

Now reduce the algebra down to 10 dimensions, in which case a 32 
component spinor will become two Majorana—Weyl spinors, each with 16 
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components. We now have two spinors Q!,, one for each chirality. If they have 
opposite chiralities in 10 dimensions, then they can be recombined into a single 
spinor, giving us the algebra for the Type IIA superstring: 


{Qa, Op} = (T"C),, Pu + TPuC)apZ 
M 
Geet C). . Zuw 
+ ee elne)., ZMNPO 
+ (Pe e*C) ap ZMNPOR- (12215) 
Counting states, this reduces to 
10+ 1+ 104 45 + 210 + 252 = 528 states (12.2.16) 


as expected. These, in turn, correspond to p = 0, 1, 2, 4, 5 branes for the 
Type IIA theory. Notice that the 5-brane of the | 1-dimensional theory reduces 
to the 4-brane and 5-brane of the 10-dimensional theory, and that the 2-brane 
of the 11-dimensional theory reduces to the 1-brane and 2-brane of the 10- 
dimensional theory. (Notice that Py of the 11-dimensional theory splits off 
into a scalar P,; and a vector in the 10-dimensional theory. Viewed from 
the vantage point of the eleventh dimension, we see that they correspond to 
Kaluza—Klein modes. However, viewed from the tenth dimension, they are 0- 
branes corresponding to soliton-like states coming from the R-R sector. This 
is the resolution of the puzzle mentioned earlier, that Kaluza-Klein modes 
coming from | 1-dimensional supergravity compactified on a circle correspond 
to solitonlike excitations of the Type ITA 10-dimensional string.) 

In summary, if we let (p;, p2) represent dual p-branes carrying (elec- 
tric, magnetic) charges, respectively, then the Type IIA electric and magnetic 
charges can be arranged according to 


R-R: (0,6), 
RR: (2,4), (122.17) 
NS-NS: (1, 5). 


Similarly, for the Type IIB string, we have two chiral spinors Q/,, where the 
algebra is given by 


(OO nC) Py (PI"C),, Zz 
+ 6 (Cae)... Zunp +8" (fi 26). ZMNPOR 
+ (PIM"PORC) , Zunpor: (12.2.18) 


where P is a chiral projection operator, and the tilde refers to traceless 
symmetric SO(2) tensors. Counting states, this gives us 


10+ (2 x 10) + 120+ 126+ (2 x 126) = 528 states (122.19) 
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as expected. This, in turn, corresponds to p = 1,3, 5 (and also p = —1, which 
corresponds to an instanton). 

Comparing this with the antisymmetric fields of the Type IIB supergravity, 
we find two second-rank tensors, one from the NS—NS sector, the other from 
the R-R sector. 

In summary, we find that the Type IIA theory must have even Dirichlet 
p-branes, while the Type IIB theory must have odd Dirichlet p-branes. By 
dimensionally reducing the various charges Z,,,.,,,, we can also deduce the 
various p-branes appearing in lower dimensions as well. It is remarkable that 
this classification for various dimensions can all be done at the group-theoretic 
level without constructing the p-branes. 


12.3. Compactification 


Now that we have assembled the machinery of BPS saturated states, let us use 
these results to discuss compactifications for D < 8. As we have seen in 9 
and 10-dimensions, the nonperturbative dual of a superstring is often another 
superstring. We find that a similar situation happens in lower compactified 
dimensions. 

In order to systematically compare the various strings and p-branes in 
various dimensions, it is necessary to calculate the number of supersymme- 
tries which survive the compactification process. To do this, we introduce the 
concept of holonomies. 

After compactification, we wish to find now many spinors can satisfy the 
constraint 


De= 0, (12.321) 


For each covariantly constant spinor which satisfies this relationship, we 
have one generator of supersymmetry which survives the compactification 
process. 

We recall that the group generated by successive commutations of D, is 
called the holonomy group. Once we know the holonomy group after com- 
pactification, we also know how many supersymmetries have survived the 
compactification process. This counting is especially easy for toroidal com- 
pactification, where the holonomy is equal to one, and all supersymmetry 
generators survive the compactification. 

To see how this toroidal compactification takes place, we wish to rear- 
range the 32 fermions contained within Q, in 10-dimensions into Q', in lower 
dimensions, where i counts the number of supersymmetries. 

In eight dimensions, a Majorana spinor is not possible, but a Weyl spinor 
is possible with eight complex components, or 16 real components. These 16 
components can be rearranged into Q' so we have N = | for the heterotic 
string and N = 2 for the Type II strings. 
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In six dimensions, a Majorana spinor is again not possible, but a Weyl spinor 
is possible with four complex components, or eight real components, so we 
have 8 x 2 = 16, i.e., N = 2 is the maximum number of supersymmetries for 
the heterotic string, and also 8 x 4 = 32, i.e., N = 4 is the maximum number 
of supersymmetries for the Type II strings. 

In four dimensions, a Majorana or Weyl spinor has four components, so 
N = 4 for the heterotic string and N = 8 for the Type II strings. 

We can summarize some of this in the following table, where D represents 
the number of uncompactified dimensions of space-time, S is the minimal 
dimension of a spinor in D dimensions, and N represents the number of 
supersymmetry generators after compactification on torii. 


PTS ens [a 
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Instead of compactifying on torii, consider the complex manifold K3. It is a 
Ricci flat, Kahler manifold with SU(2) holonomy, and hence it is a Calabi-Yau 
manifold but in only two complex dimensions. It therefore breaks just half the 
supersymmetry generators. (For example, while toroidal compactification of 
Type IIA on 7, gives us N = 4 in six dimensions, K3 compactification only 
gives us N = 2 in six dimensions.) 

For the heterotic string after compactification on K3, this compactification 
preserves only half the remaining supersymmetries, so we have N = | in six 
dimensions. 

This can be summarized in this table. Let D be the number of uncompacti- 
fied dimensions, and let N be the number of supersymmetry generators after 


(12.3.2) 
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successive compactification on Calabi-Yau manifolds with SU(2) holonomy, 
with two complex dimensions (as in K3) and three complex dimensions (e.g., 


K; @ 7). 
Heterotic | Type II 


Be 
res Nia 


sal 


Compactification on a Calabi-Yau manifold with SU(3) holonomy yields 
N = | in four dimensions for the heterotic string, as desired for physical 
reasons. It preserves only one-fourth of the supersymmetries. 

Furthermore, the holonomy group of a seven-dimensional Joyce manifold is 
G». This can take us from !1-dimensional M-theory down to N = |. D = 4, 
which is physically relevant. 

As a first step toward constructing new dualities in various dimensions. 
we must make sure that the number of supersymmetry generators N of two 
seemingly different theories matches. For example, we see that a possible 
candidate is Type II string theory compactified on K, being dual to heterotic 
string theory compactified on 7,. We will see that this conjecture is satisfied. 


(1238) 


12.4" Example: 26 


There are several ways in which to construct these dualities in lower 
dimensions, such as: 


e matching the low-energy supergravity theories between two. string 
theories; 

e comparing the moduli spaces of possible vacua of two string theories: 

e matching the BPS states and spectra of the two string theories: and 

e for chiral string theorics, demanding that all anomalies vanish. 


But the simplest way is to exploit the dualities of 1 !-dimensional M-theory. 
As we will see shortly, M-theory contains not only membranes, but also 5- 
branes as well. We will see that the origin of many of the dualities in lower 
dimensions can be traced to the original 11-dimensional duality between the 
membrane and the 5-brane. By sequentially compactifving both the membrane 
and the 5-brane on the same manitolds, we arrive at two seemingly different 
theories which are actually duals of each other. In this way, we should be able 
to derive a wealth of dualities. 


12.4 Example: D = 6 491 


Let us now analyze various dualities that exist in D = 6. For example, if we 
wrap the membrane around a one-dimensional space we call M,, which can be 
either a circle $, or a line segment S$) /Z», we arrive at a string theory (either 
Type IIA theory or the Es ® Eg heterotic string). Then we can compactify this 
further on some four-dimensional manifold, M4, and arrive at a six-dimensional 
string theory. 

Similarly, we can reverse this order. We can wrap the 5-brane around Mg, 
thereby producing a string theory (wrapping a p-brane around a g dimensional 
manifold can result in a (p — q)-brane). And then we can compactify this string 
on M,, thereby producing another string theory in six-dimensions. By duality, 
these two string theories should be dual to each other, one being fundamental 
and the other being solitonic. 

Let us take M,; = S, or S|/Z.2, and My = T, or K3. Then we arrive at 
four possible string theories by compactifying the membrane on M; ® Mg, 
which should be dual to the four string theories we obtain by compactifying 
the 5-brane on M, @ M,. 

If we wrap the membrane on S, and S|/Z, we obtain Type IITA and the 
E,® Eg heterotic strings, respectively. Similarly, if we wrap the 5-brane around 
M4 = T4 and K3, we obtain Type ITA and the heterotic string, respectively. 

We can then place the various D = 6 dualities on a chart [1—2]: 


(Ne N=) M, M, Fundamental String Dual String 


C0) S\/Z, Ky; heterotic heterotic 
(4) Ay K; Type ITA heterotic 
x1) S\/Z, Ts heterotic Type ITA 
(25,2) AY T;, Type IA Type ITA 


where Nz represents the number of chiral supersymmetries that survive in four 
dimensions. 


2.42) \D — 6, N =, 2) Theory 


Now let us examine the dualities among compactified string theories in more 
detail. Because there are a bewildering number of duality groups that enter into 
the compactified theory, it will be useful to use the old supergravity formalism 
as a guide. If we compactify the 1 1-dimensional supergravity action, we find 
that the effective, low-energy action is generically given by 


L = J=8 (R — 4 91(0)8.6/0"G! — my) P! Fry too), (12.4.1) 


where ¢! are various scalar fields which take values in a target space M 
with metric g;;(¢), and F/,,,, represent the abelian field strengths. We will be 
interested in the equations of motion for the scalar fields ¢', which are invariant 
under some symmetry group G, such that M is the homogeneous space G/H, 
where H is the maximal compact subgroup of G. 

Over the years, the physical significance of the group G appearing in su- 
pergravity theory was something of a mystery. Nonetheless, G was carefully 
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cataloged for various dimensions and supersymmetries. For example, we find 
the following table for supergravity in various dimensions coupled to super- 
Yang-Mills fields, which forms the low-energy sector of the heterotic string 
theory [3] 


O(6, 22) x SLQ, R) | OG. 22, Z) | Of6p22, 7) 2 Beez) | 
B O(8, 24)? O(8, 24, Z) O(s. 24)" Z) 


(12.4.2) 
where D represents the number of uncompactified dimensions and G is the 
symmetry group of the equations of motion for the scalar fields. (The super- 
script (1) appearing for the heterotic string compactified down to D = 2 refers 
to the affine group.) 

Notice that once we make the transition from compactified supergravity to 
the quantum heterotic string theory, we find that the group G is modified in 
a rather straightforward way to become the 7-duality and U-duality group 
of string theory. In this way, we find that the duality groups of superstring 
theory provide a convenient explanation for the group G, which emerged rather 
mysteriously in supergravity theory. This process is also aided by the fact that 
there are no R-R fields in the heterotic theory, which simplifies the counting 
when we compactify the theory. This means that the compactification of the 
various boson fields yields the massless scalar fields ¢' directly. 

The situation with the Type II theories, however, is different, as we see in 
this table, where the group G for the supergravity theory is given for various 
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dimensions D [3]; 


Ee Supergravity Group G 
Pp fsemorn [x [aan 
a [Remsen [oonn|wanewen 
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(12.4.3) 

Here, we find many R-R fields contributing to the massless spectrum, which 
all appear in the low-energy supergravity theory, and hence the theory is more 
difficult to analyze. Nevertheless, we can use the Type II supergravity theories 
as guides in which to investigate the T- and U-duality groups. 

For example, consider the D = 6, N = (2, 2) theory, which we obtain by 
compactifying the membrane on 7; (or by compactifying the Type IIA theory 
on 7,). Looking at the theory in different ways yields powerful checks on the 
consistency of the theory. By compactifying the Type IIA theory on 74, we 
obtain the T-duality group SO(4, 4, Z). But the compactification of M-theory 
by Ts yields the toroidal modular group SL(5, Z). To reconcile these two facts, 
we choose the smallest group which contains both as subgroups, which is 
SO(5, 5, Z). This, in turn, is consistent with the SO(5, 5, Z) U-duality sym- 
metry that we obtain by analyzing | 1-dimensional supergravity compactified 
down to six dimensions. 

We can also use this discussion to generalize this result for other dimensions. 
If we compactify 1 1-dimensional supergravity on a (c + | )-dimensional torus, 
with c > 5, then the moduli space of the torus is given by SL(c + 1, Z). But if 
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we compactify the 10-dimensional Type IIA string on a c dimensional torus, 
then the T -duality group is SO(c, c, Z). Again, to reconcile these two facts, we 
observe that they are subgroups of the noncompact form of E..,;(Z), which is 
the U-duality group, which is a symmetry which combines both perturbative 
and nonperturbative states [3]. Thus, we find that 


Uh Cc (12.4.4) 


In this way, we can use the low-energy sector of M-theory, 1 1-dimensional 
supergravity, as a guide in which to construct the string duality groups in 
various dimensions. 


1242 D=6, Ne, 1) Theories 


Now let us analyze the dualities of the D = 6, N = (1, 1) theories. For exam- 
ple, membrane/5-brane duality yields the fact that the IIA string compactified 
on K; is dual to the heterotic string compactified on 7%. 

If we compare the two theories, we find that they have the same low energy 
limit: D = 6, N = 2 supergravity coupled to 20 abelian super- Yang—Mills 
multiplets. Since the six-dimensional super- Yang—Mills multiplet contains four 
scalar fields, we have 4 x 20 = 80 scalar fields altogether, which span the 
moduli of the possible vacua. In addition, there are four vector fields con- 
tained within the supergravity multiplet, so there are 20 + 4 vector U(1) fields 
altogether. 

Now let us analyze the spectra of the compactified heterotic and Type IIA 
theories. We recall that the heterotic string compactified down to six dimensions 
via T, has a moduli space of vacua given by 

SO(4, 20) 
SO(4) © SO(20) ® T 
which is 4 x 20 = 80 dimensional, as expected. The six-dimensional theory 
also has 80 scalar fields. To see this, we note that the metric g,,,. and B,,,. yield 
4 x 4 = 16 scalar fields after compactifying four dimensions. Similarly, the 
Es @ Eg Yang-Mills field A%, gives us 4 x 16 = 64 scalar fields. The sum 
yields 80 scalar fields, as expected. 
We have therefore the following scalar states: 


(12.4.5) 


Say: \Ovsealans: 
Biv: 6 sealars, (12.4.6) 
an : 64 scalars. 


Also, Af, yields 16 vector fields. When we combine this with eight vector 
fields contained within g,,,, and B,,,., we wind up with 24 U(1) vector fields, as 
expected. 

We also note that the N = | heterotic string has Q, with 16 components. In 
six dimensions, the Weyl spinor has eight components, so it can be reassembled 
into Q', with N = 2, or (1, 1) in this case. 
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In analyzing the D = 6 theories, it is useful to write the spectra as massless 
representations of the little group in six dimensions, which is SU(2) @ SU(2). 
If we label the multiplicities of the various states under the little group, we find 
the following group representations for the bosonic states: 

graviton: (3, 3), 
tensor: (3, 1) or (1, 3), 
veetor: (292), 
scalar: (1, 1'): 


(12.4.7) 


If we decompose the various 10-dimensional fields according to this six- 
dimensional classification by the little group, we find 


Sane (373) 4Ce2) + 10 1): 
Boag en (lesy 4) 6( 1, 1), 
A‘: 16(2,2)+ 64(1, 1), 


@: (1,0), 


which yields 80 scalar fields described by (1, 1), 24 vector fields described by 
(2, 2), and one dilaton, as expected. 

But when we compare this with the Type IIA theory compactified on K3, 
we find many similarities. For example, we have the same number of vector 
fields. A,, from the 10-dimensional theory contributes one vector field. The 
Cyuvp tensor contributes 22 vectors. There is also one extra vector because A j.11 
in six-dimensional space is dual to a vector. There are thus | + 1 + 22 = 24 
U(1) vector fields, as expected. 

We also note that K3 preserves half the supersymmetries after compactifi- 
cation, so we have N = (1, 1) instead of (2, 2), which agrees with the previous 
(1, 1) supersymmetry we found for the heterotic string compactified on 74. 
Thus, the fermionic fields of the two theories match. 

However, we encounter some problems when analyzing the moduli space 
and the fields of the two theories. The heterotic and Type ITA theories have 
vastly different fields, with the heterotic theory possessing Es ® Es Yang-Mills 
fields but the Type IJA theory has nothing like this in comparison. We seem 
to have far too few fields in the Type IIA theory. However, this problem is 
resolved once we realize that the moduli space of the Type II theory receives 
contributions from several different sources, including the K; manifold and 
other scalar fields, so that the two theories in six dimensions actually have 
precisely the 80 scalar fields. 

To see this, let us first analyze the moduli space of K3, which is 


SOG, I9) 
SO(3) @ SO(19)’ 


which is 3 x 19 = 57 dimensional plus one additional dimension for the 
volume of the space. Thus, the 10-dimensional metric g,,,, can be decomposed 


(12.4.8) 


(12.4.9) 


R*+@ 
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into a six-dimensional metric plus an additional 58 scalar fields corresponding 
to the moduli of K3. 

To find the moduli space of Type IIA theory compactified on K3, we also 
have to factor in the contribution from other fields. The reason is because we 
also have the additional contribution of the bosonic fields , Buy, Cu, Apup 
in addition to the graviton, each of which may potentially contribute to the 
moduli space. 

To calculate the number of fields that can live on K3, we note that the number 
of massless bosons on K;3 is equal to the number of harmonic p-forms on K3. 
This, in turn, is given by the Betti numbers for the manifold, which are: by = 1, 
by = b= 0, bor 3,b; = 19 panda Sl piewithere areiiiitesiselfidual 
two-forms and 19 anti-self-dual two-forms that can be defined on this space. 
There are thus 3 + 19 = 22 generators. These 22 two-forms living on K3 
transform as scalars under the six-dimensional Lorentz group. In other words, 
the B,,, field reduces to the six-dimensional tensor plus an additional 22 scalar 
fields, while the other fields contribute nothing. 

Thus, altogether, Type ITA string theory compactified on K3 has 58+22 = 80 
scalar fields, the same as in the previous case. 

We find therefore the following number of states: 


fi: GC, 3) Soe, 
Bays (2.1) 4. (see cleen), 
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In fact, the moduli space of Type IIA compactified on K is precisely the 
same as the one for the heterotic string compactified on 7;. Comparing the 
two effective low-energy supergravity theories, we find that the actions are the 
same. 

We find therefore that 


IIA on K3 <> Eg @ Eg on 74. (12.4.11) 


12.4.3 M-Theory in D =7 


Similarly, we can perform the same type of analysis in seven dimensions. We 
can “delete” or “lift” the action of S$; on both sides of the dual relationship 
in six dimensions to obtain yet another dual relationship in seven dimensions. 
This means analyzing the duality between compactifying the membrane on K; 
and the heterotic string on 7; by removing S,. Notice that both have the same 
T-duality group SO(3, 19, Z) and the same moduli space SO(3. 19)/SO(3) @ 
SO(19) @ T, as expected. 
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This can be summarized as: 


IIA on K3 <> E;,® Egon 74, 


M-theory on K; << Es @ Eg on7; (12.4.12) 


Not surprisingly, a very large web of compactifications down to various dimen- 
sions has now been found using membrane/5-brane duality, where the dual of 
a compactified string turns out to be the compactification of another type of 
string. This provides a powerful check on the self-consistency of this approach, 
and yields invaluable insight into the previously mysterious strong coupling 
limit of string theory. 

We caution that we are still far away from analyzing reductions down to D = 
4 with N = 1, whichis the physically relevant compactification. However, new 
duality results have already provided insight into D = 6, N = 1 and D = 4, 
N = 2 compactifications, which should provide a guide for the much more 
difficult case of D = 4, N = 1. 

In summary, a vast number of discrete dualities can be constructed in various 
dimensions. One starting point is to analyze the continuous dualities that were 
laboriously calculated years ago for ordinary supergravity theories, and then 
finding discrete subsets and combinations of them which may persist in a 
superstring theory. 


(2:5) Example: D =4. NV — 2 and D=6, N= | 


The only physically relevant case, of course, is D = 4and N = 1. Because this 
case is difficult to analyze since there are millions of possible vacua, we will 
analyze simpler cases, such as D = 6, N = (1, 0), because it represents the 
heterotic—heterotic duality [4-6], and also D = 4, N = 2. (We will concentrate 
on the latter.) We can obtain the D = 4 theory by compactifying the D = 6 
theory for most cases. We will find already at this level that highly nontrivial 
results emerge, which we expect to carry over into the physically relevant case 
of D=4,N = 1. 

We begin by compactifying the Eg ® Eg heterotic string down to D = 4, 
N = 2, where we need a six-dimensional, Ricci-flat complex manifold with 
SU(2) holonomy. We can show that there is only one choice: K3 @ 7). We will 
show that this theory is dual to the IIA theory compactified on a Calabi—Yau 
manifold with SU(3) holonomy. The analysis is much richer than the previous 
D = 6case, and provides clues as to what we will find in the more complicated 
ciecor p= 4. N= 

We first notice that the heterotic string compactified on K3 @ 7) is compli- 
cated because of the nontrivial curvature of the manifold and the presence of 
the Yang—Mills fields. For example, we know from the Bianchi identity that 


dHS=arPah=Tr RAR. Gl2S:1) 
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If the left-hand side integrates to zero, then we can compute the integrals over 
the right-hand side, because they are equal to the second Chern class c for the 
respective spaces. After integrating the previous equation, we find symbolically 
that 


©)(K3 ® Th) = 24 = c7(Es ® E3). (12352) 


We know that the second Chern class for K ; @ 7 equals 24, so by the Bianchi 
identity, this means that c, for the gauge space Ex; @ E; 1s also equal to 24. But 
the integral over F A F is equal to the instanton number. Thus, we must have 
24 instantons. Because the gauge group is Ex © Ex, we have a choice as to how 
to distribute these 24 instantons. In general, we can have 12 — n instantons in 
one Fx: and 12 + n instantons in the other. Thus, the string vacua that we are 
interested in are indexed by the integer n. 

This theory, in turn, is dual to the IIA theory compactified on some Calabi— 
Yau manifold. But since there are possibly an infinite number of Calabi—Yau 
manifolds, we must restrict our choice. In particular, we wish to construct a 
Calabi-Yau manifold with an integer index n, the counterpart of the integer 
which describes the way the instantons are distributed for the heterotic case. 

We will first make two simplifying observations. We first note that one 
particular class of Calabi-Yau manifolds can be written as K; “fibrations.” 
i.e., K3 fibers defined locally over complex projective space P; as the base 
space. This means that there is a distinct K defined at each point along P). 
We represent this symbolically as K3 over P;. 

Similarly, we also note that one particular class of K; manifolds can. in 
turn, be written as elliptic fibrations. The elliptic fibration can be constructed 
by placing 7> fibers at each point on the complex projective space P) as the 
base space [7]. For this particular class of K3, we write it symbolically as 
K3; = T, over P}. 

This means that this class of Calabi Yau manifolds can be written as 7; 
defined locally over P|, and K; in turn is defined locally over another P,. Intu- 
itively, this means that the Calabi Yau manifold can be written symbolically as 
T, defined locally over a new manifold F, which is equal to P,; defined locally 
over another P;. 

But we also know that the set of P, fibrations over another P; can be written 
as the Hirzebruch space F,,, where n is related to the intersection number of 
the manifold. For example, for n = 0, we have Fy = P, @ P). 

This process can be expressed symbolically as 


K,; over P,; = [7, over P,] over P, 
=f) over (P| overP) ],, 
= Ty/Oven i. (12.5.3) 


Putting everything together, we now suspect that the Es ® Ex heterotic string 
compactified on K 3; ® 7> is dual to the IIA string compactified on a Calabi-Yau 
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manifold given by the elliptic fibration of F,, [8]. So we have 
S: Eg @ Egon K3@T <_ IIA on [7 over F,], (12.5.4) 


where the integer n describes how the instantons are distributed (for the 
heterotic string), or the index of the Hirzebruch space (for the Type ITA theory). 

A more careful analysis of the moduli space, the BPS spectra, and the 
low-energy actions bear out this conjecture. Other interesting features of this 
compactification, which we only briefly mention, are symmetry enhancement 
and tensionless strings. 


12.6 Symmetry Enhancement and Tensionless Strings 


The duality between Type IJA strings compactified on K3 and heterotic strings 
compactified on 7, has many other interesting properties. For example, the 
gauge group of the compactified heterotic string in six dimensions is U(1)*4. 
However, we can easily modify the theory so the gauge group becomes larger. 
With a certain choice of the root vectors, we can enchance the Lie group so 
that it equals a symmetry of the type A—D—E. For example, for the left-moving 
sector of the heterotic string, the root vectors belong to the lattice A7°*, whose 
automorphism group is the familiar T-duality group O(20, 4, Z). If a root 
vector a@ obeys a” = —2, then we can calculate its corresponding mass, and 
we find that it is massless and corresponds to a charged vector particle. Thus, 
when the roots of the lattice are chosen such that they obey this relationship, 
we find more massless vector gauge particles corresponding to an enhanced 
gauge group, given by the classification A-D-E. 

Although gauge enhancement in the heterotic theory compactified on 74 is 
simple, what is not so obvious is how the dual compactified Type IIA string 
exhibits this gauge enhancement when compactified on K3. At first, it seems 
impossible to obtain an enchanced symmetry of the type A-D—E from com- 
pactified Type IIA string theory. Remarkably, a careful analysis shows that the 
Calabi-Yau manifold on which we compactify the Type II string can have orb- 
ifolds which exhibit certain types of singularities. By classifying the possible 
singularities of these Calabi-Yau manifolds, we find that they can be cate- 
gorized according to precisely the same A—D—E classification. So symmetry 
enhancement takes place for both the compactified heterotic string and the Type 
II string, but for entirely different reasons. This is yet another confirmation of 
string duality. 

Another strange property of six-dimensional heterotic strings is that they 
often exhibit phase transitions. Let us analyze the D = 6, N = (1, 0) theory, 
which is chiral. Because of this, we must carefully analyze the chiral anoma- 
lies which may enter into the calculation after compactification. These chiral 
anomalies must be canceled, which places more restrictions on the model. 
A careful calculation shows that the anomaly cancellation is possible if the 
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anomalous eight-form /s factorizes into the product of two four-forms, i.e., if 
Ig — X4 A Xq4, where 


jG aati be) SD SENN 12, 
ioe ak canine. (12.6.1) 


where R is the Riemann tensor, a labels the gauge group, F, is the Yang— 
Mills two-form, and v, and v, are constants given by the theory. It can then 
be shown that the kinetic energy terms for these gauge fields appear in the 
following combination: 


Dae + tye®) Te Fy + Fa), (12.6.2) 


where v, is positive. But notice that the kinetic energy term vanishes if U, is 
negative. This term vanishes if we have the following condition: 


a (12.6.3) 
Va 

for some value of @ = ¢o. 
The vanishing of the kinetic energy term means that the theory is undergoing 
a phase transition. Since the Yang—Mills kinetic energy term is multiplied 
by 1/g?, it means that the coupling constant diverges. This also has another 
interpretation. Before, we saw that there can be dyonic strings present in the 
theory with electric/magnetic charges given by (p.q). The tension of these 

dyonic strings is given by 


ie pe ® + ge®. (12.6.4) 


It can be shown that (p,q) is proportional to (v,. U,). This means that the 
tension of these dyonic strings vanishes at the singularity. These are sometimes 
called “tensionless strings.” 

(For example, for the group SO(71), we have v = | and t = —2. We see 
therefore that the tension vanishes for the (1. —2) dyonie string. For the group 
Ex3, we have v = 7 andv = — ts so we have a tensionless string for the (1. —6) 
dyonic string.) 

It turns out that for a wide variety of compactifications, we find tensionless 
strings. However, there is one compactification in which this singularity never 
appears. For the case of the Ey ® Ex string compactified on K , we find 

Xq=Tr R?—1 Tr F) — 2 Tr Fp, 
ae 2 fy 2 ny 
X= Tr R? + (1 =) Tr FP + (1 - 


where |, 2 label the two Eg groups and the instanton numbers are labeled 


ny = 12—nandnz = 12+n. Inother words, v,; = v» = ‘and v; = (n;/12)—-1. 


)TrF?, (12.65) 


y 


This means that forn,; = n> = 12, we can never have tensionless strings. Thus, 
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if we embed instantons symmetrically into the two E’s, then we have a smooth 
transition between weak and strong coupling, without any phase transition. 

One last feature of six-dimensional string theories is the existence of het- 
erotic/heterotic duality. If we let the area of the torus 7> in K3 @ T> go to 
infinity, we recover the D = 6, N = | theory compactified on just K3. This 
means that the heterotic string compactified on K3 is dual to the limit of IIA 
strings compactified on 7) defined locally over F,, when the area of the torus 
approaches infinity. This later theory can be defined in several ways, which in 
turn gives us IIA theory being dual to yet another heterotic string. Combining 
these two heterotic string theories, we find a new heterotic/heterotic duality in 
D1, 

In summary, we find that several unexpected phenomena arise when probing 
the dualities of the D = 6, N = | theory and D = 4, N = 2 theory. This, 
in turn, indicates that the physically relevant D = 4, N = | theory is sure to 
give us even more surprises. 


12.7 F-Theory 


So far, the unification of the five superstring theories is quite remarkable, but 
there are many loose ends. In particular, it seems odd that M-theory, existing 
in |1-dimensions, cannot be directly reduced down to Type IIB superstring 
theory in 10-dimensions. Instead, we must first compactify M-theory via 7>, 
and then compare this to the Type IIB theory compactified on S,. Since our goal 
is to find a single theory whose vacua yield the various strings and p-branes, 
we would prefer to have a theory which can be compactified down directly to 
the 10-dimensional Type IIB theory without taking a strange limit. 

One intriguing fact is that the Type IIB theory has S-duality given by 
SL(2, Z), which is also the moduli group of the torus. So far, this SL(2, Z) 
symmetry has been treated as a purely mathematical symmetry, without any 
spacetime significance. However, we can also postulate that this SL(2, Z) 
arises from the compactification of a genuine spacetime symmetry on 7}. 
In this case, we postulate the existence of a new 12-dimensional theory, with 
metric (10, 2). This is called F-theory [9]. 

Theories with 12-dimensional symmetry have long been studied in terms of 
supersymmetry. Earlier, we saw that the maximum number of components that 
QO, can have is 32. If we use the usual Lorentzian metric, then | 1-dimensions 
is the maximum number of dimensions with 32 Majorana spinors. However 
if we use the non-Lorentzian space-time group O(10, 2), then we find that 
a Majorana—Weyl! spinor has 32 components, not 64 [10,11]. Thus, F-theory 
may not be such an outlandish idea. 

The essential idea is that F-theory compactified on 7> is identical to Type 
IIB theory, which serves as the origin of the SL(2, Z) modular symmetry. If we 
compactify Type IIB theory on some manifold B, then we can reexpress this 
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in terms of F-theory compactified on a larger manifold A, which has specific 
properties. Recall that the SL(2, Z) transformation operates on the dilaton and 
axion defined at a local point in space-time. Thus, there is a separate SL(2. Z) 
transformation for each point on B, which means that there must be a torus 
defined at each point on B. Specifically, if z is a point on the manifold B, then 
the complex structure of the torus is given by t(<) for each point in B. (Notice 
that a fiber bundle has a topological structure quite different from a product 
manifold T> @ B.) This means that A must have the structure of a fiber bundle, 
not a product manifold. The fiber is 7) and the base manifold is B. so that for 
every point on B we have a separate 7). 

In summary, we say that F-theory compactified on A is identical to Type 
IIB theory compactified on B, if A has the structure of a fiber bundle and is an 
elliptic fibration of the manifold B. We write this symbolically as 


A= Tf, over B. C12.7,1) 


We can also use F-theory to explain the S-duality between the SO(32) theory 
and the Type I theory. Let us compactify F-theory on 7>/Z>. which gives us the 
SO(32) theory. As before, this corresponds to compactityving on a evlinder. so 
we have lost the SL(2, Z) symmetry of the torus. This means that R, L labels 
two inequivalent theories if we take R — 0 or R — x. So SO(32). instead 
of being self-dual as Type IIB theory, is dual to another theory, Type I theory. 

We can also exploit the fact that K; = 7, @ T>/Z> for a certain class of 
K;. If we again “lift” 7 from our previous duality relationship, we have F- 
theory compactified on a certain type of K; that gives us the heterotic theory 
compactified on 7). 

We summarize this symbolically with: 


F-theory on 7) <> IIB, 

F-theory on A <= IIBonB, 

F-theory on 7)/Z. <= SO(32), 

F-theory on K3 - E,S Es onTs. 
for A being an elliptic fibration of B. 

There is no doubt that F-theory has proven to be a useful tool to probe 
the web of dualities. However, there is some doubt as to how fundamental it 
really is. It may indicate that the origin of M-theory really lies in the twelfth 
dimension, or it may be just a convenient mathematical trick. 


12.8 ExafiplesD= 4 


Although D = 4 theories are difficult to construct, we can examine the much 
simpler case of D = 4 and N = &, which is found by ordinary toroidal 
compactification. Although this theory is not realistic, we will study it in order 
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to count BPS states. We know from our previous discussion of superalgebras 
that there should be 56 central charges foran N = 8 theory in four dimensions. 

We start by compactifying 11-dimensional supergravity down to four 
dimensions on 7). é 

Let us analyze the number of gauge fields in four dimensions, and hence the 
number of electric charges appearing in the BPS relation. The metric tensor 
Sua Zives us seven vector fields, while the A,., gives us 7!/5! 2! = 21 vector 
fields. There are thus 28 electric charges in the four-dimensional theory, as 
expected from our earlier discussion of the D = 4 super translation algebra. 
(The remaining 28 magnetic charges comes from the dual theory.) 

Let us compare this with the charges coming from just the supersymmetry 
algebra. Since we compactify on 77, the supersymmetry algebra gives us 7 +7 
electric and magnetic charges. (This comes from the seven charges contained 
within Py as well as the seven charges contained within the five-form Z, nee) 
Similarly, there are 21 electric charges obtained by wrapping the membrane, 
and another 21 magnetic charges obtained by wrapping the 5-brane. (This is 
because the number of space-time scalars we can construct out of two-form 
Zw and five-form Zyyopga in seven dimensions is 7!/2! 5! = 21). There 
are thus 56 electric and magnetic charges altogether, as expected from the 
superalgebra. 

This formulation of | 1-dimensional supergravity compactified down to D = 
4 supergravity, in turn, should be S-dual to Type IIA string theory compactified 
on 7. Let us check this. Naively, we might expect that the moduli space of 
vacua for this string theory is described by O(6, 6)/O(6) © O(6), and indeed 
the 7-duality group is O(6, 6, Z). But this group is too small to describe the 
moduli space of the theory. Thus, we must also analyze the scalar contribution 
from the R-R sector. 

In particular, we are interested in the U-duality group, which we can find by 
looking for groups which contain both the 7-duality group and the S-duality 
group of SL(2, Z). We find that £77)(Z), which is a discrete subgroup of the 
original £77, labeling the moduli space of supergravity vacua, contains both 
the S-duality group SL(2, Z) and the 7-duality group O(6, 6, Z) as subgroups 


i E77)(Z) D SL(2, Z) ® O(6, 6, Z). (12°8:1) 
So we conclude that the U-duality group is £7;7)(Z). A careful analysis of the 
BPS saturated states will confirm this conjecture. 

Let us analyze the vector fields left over after compactification and hence 
the charges they couple to. If we compactify from 10 to 4 dimensions, then 
the metric g,,. and antisymmetric tensor Bya in the Neveu-Schwarz sector 
contribute 6 + 6 = 12 gauge vectors. These couple to 12 electric charges 
transforming as a 12 under the T -duality group of O(6, 6, Z). But at this point, 
we see that perturbation theory fails to account for all the charges of the theory. 
Perturbation theory gives us only 12 electric charges, but we know that they 
must have 28 electric and 28 magnetic charges. 
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The addition of R—R fields only makes the situation worse. The R-R gauge 
fields C,, and A,ap, in principle, contribute 1 + 15 = 16 gauge fields, but the 
problem is that they couple to the string world sheet via the field tensor, rather 
than the potential. Thus, they do not couple to any electric or magnetic charges. 
This means that we are still missing 16 electric and 28 magnetic charges. 

The final resolution of this puzzle comes in when we analyze the full non- 
perturbative theory, which will indeed show that there are new p-brane states, 
called D-branes, which couple to the RR fields in precisely the correct fashion. 

Now we see the resolution via nonperturbative effects. U-duality, which 
is necessarily a nonperturbative symmetry, puts the missing 16 R—R charges 
in the same representation 56 as the 12 charges found earlier in perturbation 
theory. 

We find that the 56 can be decomposed with respect to SL(2, Z) @O(6. 6. Z) 
as 


56 —> (2, 12) @ (1, 32). (12.8.2) 


The essential important point is that the 7-duality separately transforms the 
NS-NS 12 and the R—R 32. There is no mixing between these two sectors. 
But U-duality places all of these states into the same multiplet 56, and hence 
nonperturbatively mixes all these states together. 

With the addition of these R-R charges, we find the BPS relations of the Type 
IJA string are satisfied. The 12 + 12 charges originating in the Neveu-Schwarz 
sector can be constructed in terms of Kaluza—Klein states and winding modes 
of the compactified string. The 16 + 16 charges, which originally were rather 
mysterious, come from the R-R sector from a new type of excitation called 
D-branes. This gives us 28 + 28 electric and magnetic charges, as expected. 

For the heterotic string, the analysis is a bit different. The S-duality group 
is SL(2, Z), the T-duality group is O(6, 22. Z), as expected, but the full U- 
duality group is only SL(2, Z) © O(6, 22. Z). The dimension of the moduli 
space of vacua is thus 22 x 6 = 132. This can also be seen by decomposing 
the fields of the heterotic string to yield the scalar sector. If a. b label the six- 
dimensional torus, then the metric g,,, gives us 21 scalars, the two-form B,, 
gives us 15 scalars, and the 16 gauge fields A‘, for the Cartan subalgebra of 
Eg ® Ex or SO(32) gives us 6 x 16 = 96 scalar fields. Adding these up, we 
find 132 scalar fields, as expected from duality. 

Now let us count the number of U(1) gauge fields and hence the number 
of electric/magnetic charges. The metric g,,., gives us six gauge fields, as does 
the two-form B,,,. The Ex @ Ex gauge field A‘, gives us 16 gauge fields, fora 
total of 28 gauge fields, or S6 electric charges altogether, as expected. 


12.9 Summary 


So far, many of these relationships relicd heavily on the perturbative behavior 
of string theory. To obtain more confidence in the approach. we wish to have 
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a nonperturbative analysis of the theory. This is given by the BPS saturated 
states, which are believed to be nonrenormalized because of the nonrenormal- 
ization theorems of supersymmetry. These BPS states, in turn, are related to 
the existence of p-brane states. 

We begin by analyzing the c-number terms found in the super translation 
algebra 


{Qa, Op} =(T"C)ag Pu + Se eer, Lit) .uttp (12.9.1) 
D 


Historically, the second term was dropped, since it violated the well-known 
no-go theorems of supersymmetry, which made the tacit assumption that only 
point particles existed. Now that we are dropping this tacit assumption, we 
now admit the presence of p-branes which saturate the BPS relationship. 

To understand the dynamics of p-branes, consider a p + 1 rank-tensor 
antisymmetric gauge potential coupled to a p-brane. The coupling is given 


by 
| (1G Wee ROM iaalaar sas (12.9.2) 
where 
Pay= H dtd” (x, — X,,(t)) 0,X"(r), (12.9.3) 


where X ,,(0;) is a generalization of the string variable usually found in string 
theory. 
Now construct the field tensor associated with the p-brane potential: 


ee, = 01, Are ie permutations, (12.9.4) 

We can introduce another field tensor F’ such that the dual of F is identified 
with F’, i.e., 

 Eiiees = ee (12.9.5) 


Since F’ is the field tensor corresponding to yet another tensor potential 
corresponding to a q-brane, we then have the condition 


which gives us the dimension of the dual of a p-brane. 
Now consider the full 11-dimensional super algebra, including the central 
terms 


{Qa, Op} = (ie). Py + ey Zan 
+ ae). ZMNPoOR: (12.9.7) 


Each central charge term on the right-hand side corresponds to a p-brane. 
_Counting states, we find that each term contributes 


11 +55 + 462 = 528 states, (12.9.8) 
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where the 55 states correspond to a membrane, and the 462 states correspond 
to a 5-brane. 

What is rather remarkable about this simple analysis is that we can determine 
the existence of the complete set of BPS p-branes for the theory without ever 
having to construct them! 

For example, consider the 10-dimensional Type IIA algebra: 


102, Op) = (EClerere 2 ey Zope te Gera 
- Genel 6) ZMNPOQ 
+ (TMNPORC)  Zunpor: (12.9.9) 
Counting states, this reduces to 
10+1+104+ 45 + 210 4+ 252 = 528 states. C29 IG) 


By analyzing these equations, we see that there should be BPS states for even 
p-branes. 
If we analyze the 10-dimensional Type IIB algebra, we find 


(OO), =37 (PE"C) Fut (PP Cima 
cee (Cotes Te aL gu ee al ee 
+ (PIMNPORC) , Zunpor: (12.9.11) 
Counting states, this gives us 
10+ (2 x 10) + 120+ 126+ (2 x 126) = 528 states, (12.9.12) 


as expected. 

This, in turn, tells us that there should be odd p-brane states for the Type 
IIB string. 

Of what use are these p-branes? We will find that they play a crucial role 
in defining the nonperturbative structure of the theory. For example, there was 
the long-standing puzzle of what were the sources for the R—R fields. We recall 
that the Type IIA theory had massless R—R fields given by {C,,. Aji.» }. while 
the Type IIB theory had R—R fields given by {/. B’j... Cyenap}. But out of the 
string variable X,,, it was impossible to construct a source for these fields. 
Thus, the string had a net charge under NS—NS fields, but had zero charge 
under the R—-R fields. Now we see that the Type IIA(B) theory actually has 
nonperturbative states given by even (odd) j»-branes which can act as sources 
for the R-R fields. 

This 1s important for the Type IIB theory, for example, because the SL(2. Z) 
symmetry rotates the tensor field B into B’, so there must be new objects which 
carry the charge associated with B’. 

In this chapter, we have seen how the dualities found in higher dimensions 
naturally lead us to dualities in lower dimensions. In particular, we find that 
the nonperturbative region of one compactified string theory often yields yet 
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another apparently unrelated string theory. For example, Type IIA string theory 
compactified on K; is dual to the heterotic string compactified on 7%. 

There are many ways to see how this duality works. The simplest way is 
to compare two seemingly unrelated string theories which have the same low- 
energy structure. Specifically, we shall analyze the supersymmetry generators 
which survive the compactification process. If we begin with a supersymmetry 
generator Q, in 10 or 11 dimensions and then begin to compactify it down 
to lower dimensions, we find that it decomposes into Q, ;, where w labels the 
spinor index in a lower dimension and i labels the number of supersymmetry 
generators. In this way, once we know the holonomy group of the manifold 
on which we are compactifying a theory, we can calculate the number of 
supersymmetry generators N that survives the compactification process. 

In particular, the case D = 6 has been analyzed extensively. We begin with 
the fact that M-theory in | 1-dimensions contains both a membrane and its dual, 
a 5-brane (which is given to us by analyzing the BPS supersymmetry algebra). 
We can compactify both the membrane and 5-brane on a five-dimensional 
space, given by the product of a one-dimensional space M, and a four- 
dimensional space M,. The resulting theories in six dimensions should be 
dual to each other, since they were dual to each other in | 1-dimensions. 

We can let M, = S; or M, = S,/Z». Also, we can let My = K30r Mg = T3. 
In this way, we now have four ways in which to compactify the membrane and 
its dual. In this way, we now obtain four possible dualities. 

Perhaps the most interesting case is D = 6and N = (1, 1), whichestablishes 
a heterotic/Type II duality. Evidence for this duality is given by analyzing the 
low-energy structure of both theories, which are not obviously the same. 

The moduli space of the heterotic string compactified on 7, is given by the 
Narain lattice 

SO(4, 20) 
SO(4) @ SO(20) @ T’ 
which is 4 x 20 = 80 dimensional. This, in turn, must match the number of 
scalar fields of the supergravity theory. To see this, we note that the metric 2,,, 
and B,,,, yield 4 x 4 = 16 scalar fields after compactification. Similarly, the 
Ex ® Ex Yang-Mills field A’, gives us 4 x 16 = 64 scalar fields. The sum 
yields 80 scalar fields, as expected: 


(12.9.13) 


Res 10, 
ae 0, (12.9.14) 
a On 


Also, ve can calculate the gauge group which survives the compactification 
process. Af, yields 16 vector fields. When we combine this with eight vector 
fields oe within g,,, and B,,,, we wind up with 24 U(1) vector fields, as 
expected. 

We can put this altogether by analyzing the little group of the six-dimensional 
manifold, which is given by SU(2) ® SU(2). We find the following on-shell 
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decomposition of the various fields: 


Gane (on 5) MeO), 
Biya GB, 1) FerAl) 4 oe 
Ai 6, 2yeree,.1), 
¢: (1,1), 
which yields 80 scalar fields described by (1, 1), 24 vector fields described by 
(2, 2), and one dilaton, as expected. 
Now compare this with the Type IIA theory compactified on K3. The count- 


ing of scalar states is much trickier, because we must carefully analyze the 
moduli space of K3, given by 


(12.9.15) 


SO(3, 19) 
SO(3) @ SO(19)’ 
which is 3 x 19 = 57 dimensional plus one additional dimension for the 
volume of the space. Thus, the 10-dimensional metric g,,,. can be decomposed 
into a six-dimensional metric plus an additional 58 scalar fields. 

Similarly, A, contributes one vector field. The C,,,. tensor contributes 22 
vectors. There is also one extra vector because A,,,,. in six-dimensional space 
is dual to a vector. There are thus | + | + 22 = 24 U(1) vector fields, as 
expected. 

We find therefore the following number of states: 


Gis ay stein 
Cae (ll), 
Ap 22), (12.9.17) 
Buy 2 (3; 1-2 Bye ec, 1). 
Cun 2a BY. 


+ (12.9.16) 


so there are 80 scalar states described by (1, 1) and 24 vector states described 
by Gay 

Lastly, we observe that K; preserves half the supersymmetries after com- 
pactification, so we have (1, 1) instead of (2, 2), which agrees with the previous 
(1, |) supersymmetry we found for the heterotic string compactified on 7}. 
Thus, the fermionic fields of the two theories match. 

In summary, we find that the low-energy actions of the two theories have 
the same number of fields, the same moduli space, and the nonperturbative 
relationship when we let d > —@. 

This result can be generalized by analyzing the supergravity moduli spaces 
for various dimensions spanned by the scalar fields. We expect that these mod- 
uli spaces for supergravity theories can be generalized to the moduli spaces 
for superstrings by making the duality groups discrete. Fortunately, the moduli 
spaces for | 1-dimensional supergravity compactified down to various dimen- 
sions have been cataloged long ago, and represent the first step in establishing 
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dual relationships between two dissimilar string theories. By comparing the 
moduli space, the supersymmetry group, and BPS states, we can therefore 
establish a number of dual relationships. 

Much more difficult (and more interesting) are the compactifications to 
D=6,N = lor D=4, N = 2, which represent the cutting edge of research. 
Results for these two cases will shed much light on the physically relevant case 
of D = 4, N = 1. Not surprisingly, we find that these compactifications are 
highly nontrivial because of the introduction of Calabi~Yau manifolds. 

Consider the case of D = 4, N = 2. By checking the supersymmetry chart, 
we can see that the heterotic string can yield this symmetry if we compactify 
on a six-dimensional manifold with SU(2) holonomy, of which there is only 
one choice: K3 @ 7. 

Using the Bianchi identity, we can show that 


GH=TrFAF+ Tr RAK. (12.9.18) 


When the left-hand side is zero, and we integrate over the right-hand side, we 
find that the second-Chern class c) defined over the gauge space Eg ® Eg is 
equal to the second-Chern class defined over the manifold K3 @7>, which is 24. 
To satisfy this relationship, we must have 24 instantons for the gauge group. 
We must have 12 — n instantons for one Eg, and another 12 + n instantons for 
the other Ex. 

We suspect that this theory is dual to a Type II theory compactified on a 
Calabi-Yau manifold with SU(3) holonomy, labeled by some index n. Fortu- 
nately, there is a simple way in which to construct such Calabi—Yau manifolds. 
We first make the observation that a certain class of K; manifolds can be writ- 
ten as elliptic fibrations, or, crudely speaking, manifolds with torii 7, defined 
at each point of a base manifold. In this case, the base manifold is given by P}. 

Then we can write the following sequence of relationships between fiber 
bundles 


K3 over P; =[7> over P;] over P, 
= 1, over ||P, over P;],, 
= T> over F,. (12.9.19) 


The last step involves the fiber bundle defined by P; fibers defined over a base 
manifold given by P;, which is given by the Hirzebruch space F,,. Thus, the 
instanton number n defined by compactifying the heterotic string on K3 ® 7) 
becomes the index labeling Hirzebruch spaces F,, when we compactify the 
Type II string on a particular Calabi—-Yau manifold. 

So we have 


S: Ex,@Ez,0n K3@T <= _ IIA on [7 over F,], (12320) 


where we have 12 —n instantons in one Ex sector and 12 + n instantons in the 
other. 
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Many of these results can, in turn, be derived by postulating the existence of 
a 12-dimensional theory which, when compactified, becomes Type IIB string 
theory. We recall that we can define an SL(2, Z) symmetry at each point of 
the manifold, which defines a fiber bundle. We therefore define F-theory as the 
theory when compactified on an elliptic fibration, yields the Type IIB theory. 
We can summarize many of these relationships via 


F-theory on 7) <> JIB, 

F-theory on A <= IBonB, 
F-theory on 7)/Z, < SO(32), 
F-theory on K3 < E, ®@ Eg on7T», 


for A being an elliptic fibration of B. Whether F-theory is a genuine, 
fundamental theory remains to be seen. 


(129-21) 
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CHAPTER 13 


Solitons, D-Branes, 
and Black Holes 


13.1 Solitons 


We have seen that these BPS saturated states are essential to prove the duality 
relationships that we have conjectured. The important point is that it is possible 
to construct all of these p-brane states as solitons, solitonlike solutions (called 
D-branes), or as supersymmetric actions. Solitons are a powerful way in which 
to probe the nonperturbative structure of a theory, so it is not surprising that 
solitons and solitonlike objects will play an important part in filling out the 
BPS states of the theory. Even if we start with a theory which consists purely of 
strings, we are forced to admit the presence of these membrane states because 
they are solutions to the classical equations of motion. 

If we start with D = 11 supergravity, we can construct classical solutions 
which correspond to membranes as follows. We break-up the vector X” = 
(x", y’”), with uw = 0, 1, 2 representing the membrane coordinates, and m = 
Dee lO 

Then the D = 1|1 supergravity equations of motion are satisfied by the 
following metric corresponding to a membrane [1]: 


k 
av {14 
ey. 


where d&2; is the volume form for the 57 sphere, and the four-form field strength 
is proportional to the dual of the volume form on $7. (This solution, it can be 
shown, is actually divergent at the origin, meaning that it probably corresponds 
to a fundamental solution.) 

Similarly, the 5-brane soliton of the D = 11 supergravity theory is given 
by splitting the vector X“ = (x", y”) where uw = 0, 1, 2,3,4,5 andm = 


—2/3 


k 1/3 , 
dx" dx, + (1+ 5) (dy 4 y-d@,) 5 (Ssie1) 
2, 
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6,..., 10. Then the metric tensor is given by [2]: 
k\78 ke \23 
ds’ = (14 ) ax" dx, + (14%) (dy? + y°dQ3) (13.1.2) 
y? 


and the four-form field strength is proportional to the volume form on Sj. 
(Unlike the membrane solution, the 5-brane solution is finite at the origin, so 
it is probably not fundamental.) 

Thus, the p-brane states form an integral part of the web of BPS states. Start- 
ing witha D = 11 supergravity theory, we necessarily introduce the membrane 
and 5-brane solutions (as demanded by the central charges of the algebra) as 
solitons or solitonlike solutions. This is an essential point. The new philoso- 
phy of M-theory is that the various strings are just different vacua of a single 
theory. Perturbation theory just probes the vicinity of each vacua, but cannot 
take us from one vacua to another. However, when nonperturbative effects are 
introduced, then we necessarily find p-branes emerging. Thus, strings in some 
sense have lost their central role in this web of dualities (other than the fact 
that they are the only theories which have a well-defined perturbation theory). 

We will call the p-brane defined in 11 dimensions the M-branes. 

These results, in turn, can be generalized for p-branes in various dimensions 
[3]. Let us start with a generic action often found in string theory defined in 
D-dimensional spacetime, governed by the action 


an D = l —uldie po ) 3 
= = | 4 Bn a(R- (aby a Fas? . wees) 


where F’,, is the usual antisymmetric field strength corresponding to the field 
which couples to the p-brane. 

We wish to couple this to the p-brane action. We introduce the variable 
X,(&), which is now a function of the variables & which parametrizes the 
p-brane world volume 
pol 
— 


ae = ee Jag! a; x™9, XN a et? (p+) 4 Jt 


- 


a GaN i 16 OA PAM... 


where g'’ is the metric on the world volume of the p-brane. The first term 
represents the generalization of the Nambu Goto action for a p-brane, and the 
last term represents the coupling of the massless p + 1 rank antisymmetric 
field to the p-brane variables. 

To solve these coupled equations, we start with the ansatz ¢ = $(\) and 


Any urtpst = ~det (eae nani 
ds? = eAdx# a Bee aye dy", (13.1:5) 
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where 4 = 0, 1, 2,..., p represent the p world volume parameters, and 
m=p+l1,p+2,...,D—1. Similarly, we split the p-brane coordinates as 
follows: (X” = X“, Y™), and X“ = E“, and Y" = constant. 

Then a solution can be given as 


D—p-—3 
= AD» © — ago/2), 
pt+tl 
‘ 2S! ae; 
Wwe p= rhe ago/2), 
a a 
5? = ale — agy/2) + ago/2, 
p= A= 2(p Si p— 3)/(D — 2), (13.1.6) 
and 
ene kyr 15 ee, Pace I 
=e iy aan (13.1.7) 
Cae Kk Pa) ny Dp — sr 0: 


where k = 2«°T/(D — p — 3)Qp_p_2, & is the volume of a hypersphere, and 
do is the vacuum expectation value of @. 
From this, for example, we can calculate the electric charge of the p-brane 


On ee R= xT ae, (13.1.8) 


a I 

2K Sp—p-2 
The point is that even if we originally started with a theory of strings, we 

inevitably are forced to introduce solitonlike p-branes into the theory. For 

dimensions below 10, we find a large number of p-branes, so it is important 

that we systematically calculate their action and their properties. In the process, 

we will find many surprises. 


13.2 Supermembrane Actions 


In addition to constructing the solitons as classical solutions of the equations of 
motion, we can construct the locally supersymmetric actions for these various 
p-branes. 

Let us first count the physical degrees of freedom for the p-brane. Because 
the p-brane action is defined ina p + 1 world volume with reparametrization 
invariance, the coordinate X,, has D — p — | physical degrees of freedom. To 
have a supersymmetric theory, this in turn must equal the number of degrees 
of freedom within the spinor 


D-p-1=%iMN, (13:231) 


where M equals the dimension of the spinor and N is the number of supersym- 
metries in the theory. We have to divide by 4, since, by local kappa invariance, 
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we halve the number of fermion fields, which is halved again when going on- 
shell. This relation is easily satisfied for the string (in 10 dimensions) and the 
membrane (in 11 dimensions) where the right-hand side is equal to 8. (But no- 
tice that it fails for the D-branes and the 5-brane in 11 dimensions. New fields, 
such as vectors and tensors on the world volume, will have to be introduced to 
correct this defect.) 
In particular, we find the following possible solutions [4]: 

p=0: D=2,3,5,9, 

Da le 1D = 3,4, 6,10; 

[haa sie Oi nie ead be 

Dis ge = 67.8 

p=4: Dies; 

jee cae 8 uy |) 8 


(13.208) 


To write the supermembrane action [5], we start with a simple general- 
ization of the original Green—Schwarz action. We begin with a coordinate 
X (01, 02, -.., %p+1) defined on the (p + 1)-dimensional world volume of 
a p-dimensional membrane moving in space. As before, we introduce the 
generalized derivative as 


TI? = aX" — i6T“3,0, (132 3h 


where 6 is a spinor defined in D-dimensional space. Notice that this 
combination is invariant under the global supersymmetry transformation 


ONG Wel 70 nou ——e. (13.2.4) 


Then the first part of the action is given by a simple generalization of the 
Nambu-—Goto action 


S;=—T | d?t'o./—det M1; - 17). (13.2.5) 


Although 5S; is trivially invariant under a global supersymmetry transformation, 
it fails to transform correctly under a local supersymmetry transformation, and 
hence the number of fermionic and bosonic degrees of freedom do not match. 

To correct this problem, we introduce a second contribution to the action, 
which is a Wess—Zumino term. We begin by introducing an invariant p-form 
h defined by 


i aoa 
h= api. OU redo. (13.2.6) 


Notice that this p-form / is invariant under the previous global supersym- 
metry transformation. The point of introducing /: is that we can now introduce 
a (p — 1)-form b, where 


R= ab, (i325) 
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where we demand that dh = 0. Because dM” = i d6T“ dé, we have 


(d6l™ do\(d6ly,..1, 40) = 0. (13.2.8) 
This, in turn, forces us to have 
(Fis P)iag TM Phys = 0, (13.2.9) 


where P is the chirality projection operator, if @ is a chiral spinor. For the case 
p = 1, this yields the well-known constraint that D = 3, 4, 6, 10, which gives 
us the Green—Schwarz string. This new identity, however, forces us to obey a 
new constraint, which is given by contracting the identity with ("’)**. After a 
bit of work, we find once again that D — p — 1 = MN/4, as before. 

Then the Wess—Zumino action is given by 


ps8 a 
S> = -or f *b = ————____ if Fela 8 alan CFT an (13-2710) 
Ge : 


If we write this out in detail for the supermembrane, we find for the Wess— 
Zumino term 


iT re : 
52 = ~> fi do {(e*6T 8:9) [TT + i461’ d,0 
— (3)(61F*4,6)(6Td,.4)]} . (13.2.1) 


Now let us check that this action is locally supersymmetric. Let 66 be 
undetermined at this point. Then we find 


1 = 
On — al seo RIG LO pee 0e- (13.2. 12) 
p! 
For the case p = 2, we find: 


8S = 2iT il odd (/=e8"Ti = sets) 8,0 


=iT | dos — c)./—gg''T;0,6, (1372; 13) 
where 
u y Up i Wee ee 
P=, Py = NNT, see eae 
(13.2.14) 


We see therefore that 6§ = 0 if we choose 
@ = sell. oe = Cl se IM ys), (e215) 


where 4(1 +I’) is a projection operator. 

The important point is that the introduction of a Wess—Zumino term into the 
Nambu—Goto action has rendered the action locally supersymmetric, such that 
we can eliminate half the components of the 32-component spinor. (We can 
eliminate another half of the spinorial components by going on-shell, leaving 
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us with the desired eight components to match the eight components of the 
bosonic theory.) 


13.3. Five-Brane Action 


By counting states, we find that the 5-brane does not satisfy the usual counting 
of p-branes. To construct the 5-brane action, we must introduce a new second- 
rank tensor which is described by a self-dual field tensor F;,,n/. 

Let us count the number of physical modes 


(D-p-+(? 7°) =3mn, (13.3.1) 


where the first two terms represent the string and two-form degrees of freedom, 
respectively. The solution for this is p = 5, giving us a 5-brane. 

This poses new problems, however, since in general a covariant action for 
a propagating self-dual field is extremely difficult to construct. Various no- 
theorems exist, in fact, which actually show that it is impossible to write a 
covariant action for a propagating self-dual field under certain conditions. 
Solutions to this long-standing problem have been proposed, but which involve 
an infinite number of auxiliary fields. Recently, however, a convincing action 
for the 5-brane has been written involving just a single auxiliary field [6-7]. 

We start with a tensor field on the world volume A,,,,,. Its field tensor is given 
by Frnt = 2 (0;Amn + On Ant + On Aim) « 

Then its dual F*”"” is defined as 

*lmn | 
i 5 = 

We will also find it crucial to introduce a scalar field a (which will be 

eliminated via gauge fixing) and the tensor 


. l 
* dla. (13.3.3) 


Finn SS = Fae 
V (ma)? 


The bosonic part of the 5-brane action can be written as L = L, + Lo, with 


imnpqr BE 
pqr- 


(1883.2) 


Ly = ,/ —det (mn +iFnn), 
Dae 
Ly = -——— F*" Fda’. whe 
2 ap 1pda (13.3.4) 


This action is invariant under the usual variation of an antisymmetric tensor 
(for small parameter @,, ): 


Ans = 5 Oma On); 
6a = 0. (13.3.5) 
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This is not surprising. But what is unusual is that it is also invariant under a 
variation which allows us to gauge the a field entirely away 


aoe 
2(aay? 
ba = 6, (13.3.6) 


6Amn-= (Fpnpoea a Vien) ’ 


where 


[_(aq)2 5,/ —de i F 
Vien = 2 eat aeeaet lee (1327) 
8g 6 Finn 

This last variation is the key to establishing a covariant action. Although the 
field a was introduced to maintain covariance, eventually we wish to eliminate 
it among the physical states of the theory. 

To check that this theory does in fact yield a self-dual tensor field, we can 
power expand the action in flat space, in which we obtain 


L=ahiee  — O° a(F — Fan F — F*y"6,a, (13.3.8) 


8(da)* 
where we derive the constraint 
ant ee 0 (13:39) 


as desired. We see that the tensor F,,,,; is self-dual. 

Similarly, we find that the above action can be made supersymmetric with 
the addition of spinor fields 6. 

(We have now found 5-brane solutions in two different ways: via soliton 
solutions of the low-energy action and also via a supersymmetric action. There 
is some confusion, however, over which extended objects are“fundamental” 
and which ones are“‘solitonic,’ which differ by whether they are divergent at the 
origin. Under a duality transformation, in fact, we can turn one into the other. 
For example, we found earlier that the counting of BPS states requires both 
KK states and winding modes, the latter which are considered to be solitonic. 
But 7 -duality converts winding modes into KK modes, so what is fundamental 
and what-is solitonic is a matter of taste. However, one classification that is 
widely used is to consider the tensions for the various p-branes. The tension 
for the fundamental string is of order 1, the tension for the D-branes is of order 
1/g,, and the tension for the D = 11, solitonic 5-brane is of order ie) 


13.4 D-Branes 


Lastly, we will explore in more detail the D-brane [8, 9]. There are important 
reasons for introducing this new kind of object, which shed much light on the 
nonperturbative nature of string theory. 
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First, we notice that we need a new type of object to couple to the R-R 
background fields. So far, the strings that we have analyzed only act as sources 
to the NS-NS background fields, not the R-R fields. The R—R fields do not 
couple to the string world sheet. In fact, to any finite order in perturbation 
theory, the string has no charge under the R-R background fields. Thus, we 
need a new nonperturbative object, called D-branes, which can act as sources 
for the R-R background fields. 

We recall that the tensor field B,,,, background field arising from the NS—NS 
sector couples directly to Type II strings, so that Type II strings have a charge 
under this tensor field. However, the R-R fields couple to the string only via 
their field tensors F,,,_,,,. So the NS-NS and R-R background fields couple 
to the string via 


ELGG RE ct, SO uae 


where S, are the standard spin fields defined on the string world sheet. Thus, 
the string has no charge under the R—R fields. This poses problems. because 
duality will in general mix the two sets of charges. For example. we saw earlier 
that the Type IIB string has two massless tensor fields B and B’, such that the 
duality group SL(2, Z) turns one into another. But since the Type IIB string 
carries no charge under B’, it means that we must introduce a new object, 
the D-brane, to carry this charge. For example, if (1. 0) represents the usual 
Type IIB string which has charge one under the NS—NS field but zero charge 
under the R—-R field, then an SL(2. Z) rotation will in general transform the 
(1, 0) string into a (7m, n) string, which can never be seen to any finite order in 
perturbation theory. 

Second, there is a surprisingly large family of such D-branes in lower di- 
mensions. If we analyze the super translation algebra in 11 dimensions. we 
recall that we only have p = 2 and p = 5 M-branes. But if we compactify the 
super translation algebra, then the p = 2 and p = 5 sources in 11 dimensions 
decompose into a very large family of p-branes in lower dimensions. many 
of which correspond to D-branes. In general. we will have even dimensional 
D-branes for the Type IIA string, and odd dimensional D-branes for the Type 
IIB string. 

We noticed earlier that there was a simple relationship which determined 
the dimension of a super p-brane. However, the D-branes resulting from com- 
pactifying M-branes do not, in general, obey this relationship. Thus, we have 
to introduce a new vector field in addition to the p-brane field in order to 
generalize the brane-scan. 

A vector field on the (7 + 1)-dimensional world volume contributes another 
p—I| degrees of freedom to the physical states. So we must modify our previous 
formula D — p — | = {MN to the following 


D-brane: D-—2=3iMN. (13.4.2) 


Notice that the p has dropped out of the calculation. Comparing this with 
the R -R states generated by Type IA, IIB, and I strings, we find that D-branes 
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can match them perfectly. This allows us to complete the counting of BPS 
states, which is a powerful check on the self-consistency of our nonperturbative 
analysis of duality. 

Originally, D-branes were discovered by analyzing T-duality in Type I 
theory. Before, when we made the standard 7-duality transformation, closed 
strings were mapped into closed strings. We found that 


Xu(Z) + Xu(Z) > XZ) — XuZ) (13.4.3) 


under a 7 -duality transformation. So T-duality simply linked different closed 
string vacua. However, if we try to perform a T -duality transformation on Type 
I open strings, we find that the boundary conditions interfere with the analysis. 
Specifically, the Neumann boundary condition at the end of an open string, 
after a duality transformation, turns into a Dirichlet boundary condition. 

Let us compactify the jth direction. The standard open string expansion of 
X,, transforms into X » as follows: 


X, =X, — ia’ p, Inzz (13.4.4) 
4S > ( our (13.4.5) 
2-H 
X, = &, — 10% In (=) (13.4.6) 
tile Sm (zm zm) (13.4.7) 
neo 


In other words, the Neumann boundary condition for X ,, becomes a Dirichlet 
boundary condition for X,,: 


ep 0 == Ox =—0 (13.4.8) 


or X 4 = 0 at the endpoints, where x is the dual string. 

The dual to an open string therefore is an open string such that the endpoint is 
now fixed. For the 26-dimensional bosonic string, this means that the endpoints 
of the dual open string lie on a fixed (24 + 1)-dimensional hyperplane. But 
since string theory is a theory of quantum gravity, the dynamics of the theory 
will eventually make this hyperplane move with time, we therefore are forced 
to introduce a new object: a new type of p-brane on which strings can end, the 
Dirichlet-brane or D-brane. 

Let X”5 represent the 7-dual string with Dirichlet boundary conditions on 
the twenty-fifth coordinate. If we integrate 


[eo 0-25 = X25 (r) — X75(0) = 200" p® = 2na'n/R = 2nnR, 
(13.4.9) 


where we have used R = a’/R. But since we have compactified the twenty- 
fifth direction for the dual theory, it means that both ends of the string must 
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lie on the same D-brane. This can be generalized for the arbitrary case. If k 
out of D total dimensions are compactified, then 7 -duality leads to a Dirichlet 
p-brane with p= D—1-—k. 

Notice that T-duality converts a Neumann boundary condition into a Dirich- 
let condition. This, in turn, can explain how 7 -duality changes the properties of 
D-branes in Type IIA and IIB theories. We saw earlier that Type I[A(B) theory 
has even (odd) Dirichlet p-branes. If we make a T -duality transformation, then 
even and odd p-branes must turn into each other. The way this happens is as 
follows. If we start with the Type IIA theory and compactify one dimension 
and make a 7-duality transformation, and if a D-brane is wrapped around the 
circle, then one of the Neumann boundary conditions of an open string turns 
into a Dirichlet boundary condition, so the world volume dimension of the 
D-brane decreases by one. In this way, we can go back and forth between even 
and odd D-branes via T -duality. 

Now consider the case where we have open strings with Chan—Paton factors. 
For example, we recall that the Type I string incorporates O(32) symmetry 
because the open strings have isospin matrices attached at each end. Vertex 
functions appear with an explicit Lie algebra matrix A, in them, which gives 
rise to a trace over these matrices TrA“A?A‘°A“ for scattering amplitudes. We 
will find that the usual Kaluza—Klein formula for the momenta, p = n/R, is 
significantly altered by the presence of gauge fields. 

Consider an open string moving in the presence of a massless field, in this 
case the U(N) Yang—Mills field. Choose a classical configuration of the Yang— 
Mills field equal to a pure gauge field A,, = —iU~'0,,U, such that 


U = diag(e! O27 | giXP@w/2nRy (13.4.10) 


where U is a diagonal N x N matrix but takes different values e'* “ 27% 
along the diagonal. The presence of this classical configuration will break 
U(N) symmetry down to U(1)%. 

Notice that U is not periodic under the transformation X°* = X°*5 + 27 R. 
In fact, we pick up an additional phase factor for U 


U — diag (e ...e!) U. (13.4.11) 


Now consider the effect of this phase change on a vector vertex of the string, 
which is labeled by a Chan—Paton factor |i), where these indices are in the 
fundamental representation of U(N),soi = 1,2..... N. The ith element will 
pick up a phase e’, the jth element picks up the complex conjugate e~'', so 
the vertex function picks up a phase, 


lif) —> ef 177), (13.4.12) 


The vector state |i) also has momentum p~* associated with it, which also 
picks up a phase factor under X°° -+ X*5 + 20R. Normally, a state with 
momentum p picks up a phase factor e‘” under the shift.» — x +. We find, 
therefore, that the additional phase picked up by the gauge field yields an extra 
6; /(27 R) to the momentum. 
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The possible momentum for the vertex function is now equal to 
p>? = (2xn + 6; — 6;) /(27R). (13.4.13) 


In order to determine where the open strings end, take the integral 
f dod, X*°. We now find 


Xr) — X°5(0) = (2an +6; —6,)R (13.4.14) 


so now the open string no longer has to begin and end on the same D-brane. We 
now find open strings ending and beginning on N distinct parallel D-branes, 
located at positions 6; R, with a network of open strings connecting them. The 
gauge group associated with this is not U(N), but U(1)”. 

However, it is possible to restore U() invariance. Now let the D-branes 
gradually merge into one D-brane. Then we have 6; — 0;, and the full U(N) 
symmetry is restored. The vector state |i) no longer picks up a phase factor 
from the isospin transformation. 

This restoration of U(N) can also be seen by analyzing the spectrum of the 
string. Normally, the spectrum of the open string is given by 


M? = (p>) + —(N - 1) (13.4.15) 


A (27n +6; —6;)R 
27a! 


2 
+—N- 0) (13.4.16) 


where NV is the number operator of the harmonic oscillators. If we analyze the 
lowest lying states for V = 1 and n = 0, we find that new massless vector 
states emerge as 6; —> 6;, which are required to complete the spectrum of the 
full U(V) gauge field. 

Thus, when the hyperplanes are separated, we have N massless vector fields, 
one for each of the hyperplanes. But as the hyperplanes converge, the number 
of massless vectors rises to N’, giving us U(N) symmetry. The lesson here 
is that the gauge group associated with N parallel and distinct D-branes is 
U(1)", which becomes U(N) as the D-branes become coincident. This will 
prove crucial when we discuss bound states of N Dirichlet 0-branes. 


13.5  D-Brane Actions 


The action for D-branes can also be constructed in several ways. 

First, we might repeat the same steps used in ordinary closed string theory 
to find the effective low-energy action [9]. Since we are working with an open 
string, the massless sector is given by a vector and a spinor. The coupling of 
the string to the background vector field is given by the following boundary 
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term: 
P 
S= fas Oe ee 
m=0 
25 - 
4 / ds) AVG", ane (13.5.1) 
i=p+l 

where the massless vector field only depends on x°,..., x”, the coordinates 


describing the p-brane. We compactify the coordinates labeled by 7. One im- 
portant point is that A; describes the fluctuations transverse to the brane. 
Because of this, the p-brane is dynamical, rather than being a fixed hyper- 
plane. As before, we integrate out the higher string modes, leaving us with the 
beta function relation 


et. (13.5.2) 


Since the resulting theory must be conformally invariant, we can now treat 
these beta function equations as the equations of motion of the massless fields. 
The action for the D-brane is then chosen so that the 6 = 0 equations arise as 
equations of motion. 

By explicitly performing all these steps, we find the D-brane action given 
by the Dirac—Born-Infeld action 


d?*'q,/—det(G;; + F;; — Bij), (13.573) 


where f” represents the value of X* at the boundary of the p-brane for the 
Dirichlet conditions, and where G and B are the usual terms formed from the 
pull-back to the membrane surface, i.e., Gj; = 0; f¥0; f” Quy. 

There is yet another way to construct the action which is more direct: take 
the p = 2 M-brane action of M-theory and dimensionally reduce it down 
one dimension [10, 11]. The resulting 10-dimensional theory should contain 
both the usual Type IA string as well as the p = 2 membrane, which is the 
D-brane. Thus, a Kaluza—Klein mode arising trom the compactification of 11- 
dimensional super membranes becomes a Dirichlet 2-brane in 10 dimensions. 

We recall that the supermembrane action in |] dimensions in flat space 
contained the term g TI" T1;.4;, where M is an 1 1-dimensional index. If we 
separate out the eleventh dimension, and define @ = X,,. then we get 


gu Ty > eT + ig (0:6 — 167 119:0) (8; — i61119;) . 
(13.5.4) 
Now we wish to replace the @ field everywhere with a vector field L,, such that 
d;@ = L;. This is accomplished by adding to the action a Lagrange multiplier 
A 


j 


eUF Aa; Ly. (135) 
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If we eliminate the A; field from the action, then it enforces the constraint 
0;L; — 0;L; = 0, which is solved by setting L; = 4;@, so we can replace 0;¢ 
with L; everywhere. 

So far, we have done nothing, since by eliminating A; we retrieve the original 
action. Now, let us take a different path and eliminate L; instead. Then the 
relevant terms in the action are 


eK AO, Ly + gi LjiLj ++. (13.5.6) 
Eliminating L; from the action yields the U(1) Maxwell term 


Putting everything together, we find 
l 7 ey ae 
S= =; f Po/=ate!'T, + Seo! FF —1] (13.5.8) 
1 . A 
sae / Boel* (bij, — 310711 0;:0 Fix], (13.5.9) 


where 


Fy = Fiy — by (13.5.10) 
and b;;, and b;; are complicated functions of I] and 9, arising from the original 
b;;, found in the membrane action, in which the 0; X;; components have been 
explicitly removed. 

In summary, we find that the D-brane action is the usual Nambu—Goto action 
with an additional U(1) Maxwell field tensor. It has the structure of a Born— 
Infeld action. 

The previous calculation was done in flat space with all the fermionic fields 
present. More interesting is the case when the D-brane moves in the presence 
of background fields. The same calculation can be done with background fields 
(setting the fermionic fields to zero). Then we obtain, for the bosonic action 


d?*\o,/—det(G;; + Fi; — Bij), (135.11) 


where G and B are the usual terms formed from the pull-back to the membrane 
surface, i.e.,G;; = 0;X"0; X" g,,,. (Before, we found that this term in the action 
was a function of the Dirichlet boundary term f“ in the effective field theory 
formalism. We see that this has been transformed into the p-brane coordinate 
X* in this formalism.) 

The fact that F;; — Bj; occurs in this particular combination can be seen 
from the fact that the gauge transformation of the B field, which usually can- 
cels to zero, now picks up a boundary term, which must be canceled by a 
corresponding change in the vector field 


Bi; —> Bij + 0:Xj — 9; Xi, 
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The most immediate application of D-branes is to analyze the proof that M- 
theory compactified on a circle is equivalent to Type IIA string theory. Crucial 
to the proof was the statement that the Kaluza—Klein modes resulting from 
compactified M-theory correspond to BPS saturated modes in 10 dimensions. 
We saw earlier that the momentum of the eleventh dimension is quantized 
according to 


N 
Pee (13.5213) 
The N = 0 term corresponds to the ordinary string, the N = | term corre- 
sponds to a single Dirichlet 0-brane, and the higher N modes correspond to 
bound states of Dirichlet 0-branes. 

Now consider the case of N parallel and flat D-branes, with an open string 
attached between each pair. Each end of the open string has a U(1) gauge field 
associated with it. Now let the N parallel D-branes gradually merge into a 
single hyperplane. Then we find that additional massless vector boson states 
emerge, which yield a gauge theory based on U(N) [12]. 

Now re-do the calculation of the D-brane action using the 6 = 0 condition. 
If the branes are widely separated, then we arrive at the usual U(1) Dirac~Born— 
Infeld action. But now perform the same 8 = 0 calculation, letting the D-branes 
slowly coincide. We then find that coincident D-branes are described by 10- 
dimensional U(N) super-Yang—Mills theory reduced down to the (p+ 1 )-space 
of the D-brane 


S= 7 f aot FF ™ Bc 10) 0 eee (13.5.14) 


(In general, there can be other terms involving higher powers of the field tensor 
F and its derivatives.) Notice that the fields are only functions of p+1 variables, 
not the entire 10-dimensional space. This means that many of the terms drop 
out 


Fiat On An Ona lam. Onl (13.595) 
Finj = OnX; FifAm, X;], (13.5. 16) 
Fi; = i[X;, Xj]. (1iasdel'7) 
where i, j represent the modesi = p+1.p+2..... 9, and the p-brane modes 


are represented by mi —=On 1.2) 57. p. 

Consider the case, p = 0, representing a point particle. For the Dirichlet 
Q-brane, this means that all the membrane coordinates have disappeared except 
for one: 


Fig = i[X', X/], 

Bg Xle Oy NX! &iTAG, KX"), 
i[X;. 6]. 

Doo = a6 aan 


cs 
| 


(13.5. 13) 
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So the action reduces to 


Ss rf arts (4(Dox'? = 10" DEG PE (LX' X7]) LOT y 1x), 61) 
(1375219) 
The remarkable feature of this U(N) action is that we can now begin to 
describe multi-Dirichlet p-brane scattering amplitudes. Notice that if the ma- 
trices for the super- Yang-Mills theory are block-diagonal, with many U(N,) 
matrices appearing along the diagonal, then the theory trivially breaks up into 
a series of noninteracting U(N;) D-brane theories. The off-diagonal parts of 
the group matrices therefore correspond to the interactions between D-branes. 
It seems rather strange that the same super-Yang—Mills action can, by itself, 
describe the interactions of several D-branes, not all of them coincident. 


13.6 M(atrix) Models and Membranes 


One of the most important applications of D-brane physics is to elucidate the 
nature of M-theory. We say earlier how complicated M-theory was, with 2- 
branes and S-branes. But one of the most remarkable conjectures about the 
nature of M-theory is that it reduces to a simple matrix model expressed in 
terms of Dirichlet 0-branes, in a certain limit. 

We begin with the observation made earlier that M-theory compactified on 
a circle is S-dual to Type IJA string theory. The supersymmetric translation 
algebra in | 1 dimensions contains the term py, which decomposes into p,, and 
Z after compactification on a circle. Notice that Z corresponds to the charge of 
a Type IIA Dirichlet 0-brane, given by the Kaluza—Klein quantization condition 

N 
ja R (13.6.1) 
for integer N. As we saw earlier, the Kaluza—Klein states of the compactified 
1 1-dimensional M-theory are equivalent to the nonperturbative 10-dimensional 
Dirichlet 0-brane states. 

Now take the infinite momentum limit, i.e., let Z = pi; — oo. In this 
case, the Dirichlet 0-brane term dominates over all other terms in the super 
translation algebra, so that the theory reduces to that of a Dirichlet 0-brane. This 
infinite momentum limit, in turn, can be realized by taking the limit N > oo 
and Pi) see OO 

Notice that a single Dirichlet 0-brane has momentum given by | /R, and that 
a bound state of N Dirichlet 0-branes (which coincide) has momentum N/R. 
Furthermore, we can show that this bound state of N Dirichlet 0-branes is 
described by supersymmetric quantum mechanics (not quantum field theory). 

The string state corresponds to N = 0. This is because strings cannot be the 
source for the R-R fields, which are presented by the central charge Z. 

Thus, we are led to the following conjecture {13, 14]: 
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Conjecture. The infinite momentum limit of M-theory is equivalent to the 
N -> oolimit of N coincident Dirichlet 0-branes, given by U(N ) super-Yang— 
Mills theory. 


Since the full action of M-theory is totally unknown, it seems almost mirac- 
ulous that such a complex theory can be represented (in this peculiar limit) 
by such a simple theory: large—N point particle super- Yang-Mills theory! But 
the fact that physics simplifies in the infinite momentum frame is one of the 
reasons why it was introduced years ago. Large classes of Feynman diagrams 
vanish as 1/p); as pi; > ©. 

For example, Feynman diagrams corresponding to particles which are cre- 
ated out of the vacuum are suppressed in the infinite momentum limit. This 
means that the vacuum is trivial in this limit. Furthermore, the Feynman 
diagrams in the infinite momentum limit resemble the “old-fashioned” per- 
turbation diagrams found in nonrelativistic field theory. For our purposes, 
we exploit the fact that Feynman diagrams involving everything except the 
Dirichlet 0-branes are suppressed as | /, in the infinite momentum limit as 
P11 — oo. In particular, the string, because it is not charged under the R-R 
field, has pj; = 0 = N and is suppressed in the infinite momentum limit. 

To see this, let Py label the momenta ofa collection of particles. Let the sum 
of these vectors equal P. Then each momenta p, can be decomposed as 


Da =NaP + Pra (13.6.2) 
such that 
Musil. Yo ba. ieee (13.6.3) 


Now let us boost the system of particles such that P — ©. For sufficiently 
large P,, we see that all 7, are positive. 
The point of taking this large P limit is that we can write the energy as 


| — Noe ge 


zs er ee ‘ 
=e i +O(P~). (13.6.4) 
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Let us now analyze the Feynman diagrams which appear in the theory in the 
infinite momentum limit. We see that the covariant propagators start to resem- 
ble nonrelativistic energy denominators found in nonrelativistic perturbation 
theory. Let us focus on the differences in energy. 

If two particles have positive and equal 7,,. we see that their energy dif- 
ference goes to zero as 1/P. But now analyze what happens to the energy 
difference between a particle with positive 7, and another with negative 7, 
(which occurs because we must integrate over all momenta in field theory). 
The energy difference grows as P. Thus, this energy denominator damps as 
1/P + 0, so that particles with 1, < 0 decouple from the theory. Thus, our 
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conclusions is that states with negative or vanishing n, have energy denom- 
inators which make them decouple from the theory, leaving only the states 
with positive 7,. But in matrix models, the only states with positive 7, are 
those states with momentum p,; = N/R, where N > 0. Thus, M-theory in 
the infinite momentum limit reduces to a theory of particles with positive n,, 
i.€., States with positive p,,;, which are the Dirichlet 0-branes. These states, in 
turn, are described by super-Yang—Mills theory defined on the world volume 
of the D-brane. The states with vanishing or negative 7, are the string states 
and anti-Dirichlet branes, which decouple from the theory because of infinite 
energy denominators. 

One nontrivial aspect of this limit is that although we can simply drop the 
states with strings and anti-D-branes because they don’t have positive n,, the 
theory “knows” that these particles must exist in M-theory. 

This identification is made all the more remarkable since super- Yang—Mills 
theory is defined only in 10 dimensions, yet the N — oo limit of this theory 
yields a fully 1 1-dimensional theory (M-theory). Somehow the N —> oo limit 
makes possible the emergence of the eleventh dimension. 

We see evidence of this simple yet nontrivial conjecture because the Yang— 
Mills field has 16 fermionic fields. When we take composites of this, we find 
2% = 256 states, which is precisely the number of states found in supergravity 
theory. 

We also find convincing proof of this conjecture when we investigate the 
scattering of Dirichlet 0-branes, and find they exactly reproduce the scattering 
of | 1-dimensional supergravitons [15—18], which is a highly nontrivial result. 

For example, let the scattering take place in the eighth and ninth spatial 
directions, where eight represents the horizontal axis and nine the vertical 
axis. Before impact, let’s say the two particles are coming at each other along 
two parallel, horizontal paths along the eighth axis. They: are moving toward 
each other with relative velocity v. But their trajectories are off-center by 
a vertical distance, the impact parameter 0, in the ninth direction. Then the 
distance between these two Dirichlet 0-branes in the 8—9 plane is given by 
J(vt)? + b?. 

In isospin space, X isa2 x2 matrix, such that the diagonal elements represent 
the location of the two Dirichlet 0-branes. Let X' = B' + \/gY', where B' 
labels thé location of the two classical Dirichlet 0-branes, and Y' represents 
the quantum fluctuations around the two Dirichlet 0-branes. Let’s say that the 
diagonal element B,; represents one Dirichlet 0-brane, and By. represents the 
other. Then we can say that 


a ion es Fe 
, =| ote 1) 9 ep) 
(13.6.5) 


and 03 represents the Pauli matrix, since the coordinates are 2 x 2 matrices for 
the group U(1) x U(1). The other components of B' are set equal to zero. 
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Now power expand the super-Yang—Mills action around the classical 
configuration B. We find the following action written in terms of Y: 


Sy =i ‘| dt ($Y{(82 — PY} + 9¥2(8r — Wa + 3 YOY 
— f/ge®* ec piyiyiys — 5 tht gots ViYJ¥.Y}). (13.6.6) 


Since B; represents the classical position of the string, it only contributes to 
the mass term of the action expressed in the quantum variable Y'. Therefore 
the first line of this equation give us the kinetic and mass terms for Y'. 

If A is the zeroth component of the gauge field, then we can similarly write 
the action for this field 


Sa =i f ac(save: — r?)A; + 4A2(8? — r?)Az + $4307 A3 


+ 26730, ByAaY), + /ge”a, YI ALY: 


— Jfge?* cP BiA,A,Y! — ete meant). (13.6.7) 


By explicitly inserting the classical value of B' into these two equations, 
we can now read-off the free-field part of the action in terms of bosonic fields 
with mass given by Y' and A. There are three isospin components to each 
of the gauge fields Y' and A, and there are 10 such gauge fields, so we have 
30 bosonic fields altogether. By diagonalizing the mass matrix, we find the 
following arrangement of the boson and fermion fields: 


16 bosons with masses r?; 

2 complex ghost bosons with masses r?’; 

2 bosons with masses r* + 2v; 

2 bosons with masses r? — 2v; 

10 massless bosons; 

8 fermions with masses r? + v; 

e 8 fermions with masses r? — v; 

e 2 complex ghost bosons with masses m7; and 
e | massless complex ghost boson. 


If we functionally integrate out over a boson field with mass r* + a, then 
the following determinant emerges in the path integral: 


D, = det(—d, +r? +a). (13.6.8) 


By including all these massive fields, we find that the functional integral is 
modified by the determinant 


Die DF DE Den)... (13.6.9) 
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We can lift each D into the action of the path integral by D~! = e~'82, 
Then the effective action gets modified by 

S — S — log Dit. (13.6.10) 

We can define the potential associated with the scattering process by 
log Dex = f drVen(b? + v°r?) (13.6.11) 

Putting everything together, after a bit of algebra we find 
15v4 Th 
— a 6. 

ff er? +o(=) (13.6.12) 


This in turn, is precisely the number obtained from ordinary supergravity 
scattering, which gives us confidence that this formalism is correct. In fact, this 
calculation has actually been done to two-loop levels, with prefect agreement 
with the supergravity result [18]. 

For general (p — p)-brane scattering, we have 


4 6 
Ven = 5, + 0( = i) (13.6.13) 
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This formula has several interesting consequences. Notice that the potential 
is a function of velocity, and vanishes for zero velocity. This reproduces the 
“no-force” condition between two static D-branes, which is often found in the 
theory of solitons. For example, if we take two parallel and static hyperplanes 
representing two D-branes and calculate the interaction between them, we can 
sum over the fluctuations of an open superstring which connects the two hy- 
perplanes. Because of supersymmetry, the bosonic and fermionic string modes 
cancel each other out, and we find that there is no force between the two static 
hyperplane D-branes. 

More important, we find that the effective distance which a D-brane can 
probe is proportional to the velocity, which means that D-branes may be able 
to probe much smaller distances than the traditional string. It is usually thought 
that the conventional string cannot probe any smaller than the Planck length. 
Because of 7 -duality, we can argue that a new“uncertainty principle” is emerg- 
ing, preventing us from using strings to investigate distances smaller than the 
Planck length. This has led some to state that there is an ultimate distance, the 
Planck length, below which physics cannot probe. 

The velocity dependence of the D-brane scattering potential, however, seems 
to indicate that at low velocities, we may be able to probe distances much 
smaller than the Planck length. For example, from the scattering potential given 
above, we can extract out the impact parameter b of the scattering process, 
which turns out to be on the order of \/v. So the effective distance probed by 
the D-brane goes as the square root of the velocity. Whether this suggests that 
D-branes have a deeper physical significance remains to be seen. 
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Lastly, we also see evidence of the M(atrix) model conjecture in the fact that 
11-dimensional membranes which appear in M-theory can also be viewed as 
collections of Direchlet 0-branes. This is a rather remarkable result, since the 
lack of a suitable quantization scheme for the | 1-dimensional membrane was 
one of the many reasons why it was abandoned in the 1980s. Specifically, the 
membrane action (even at the free level) is highly nonlinear and difficult to 
quantize. Worse, the Hamiltonian does not admit stable states [19], and the ac- 
tion is also ultraviolet nonrenormalizable. For all these reasons, | 1-dimensional 
membrane theory was considered an oddity, rather than a fundamental result. 

The new interpretation of the 11-dimensional membrane theory is that it 
arises in the large N limit of Direchlet 0-branes, and hence all the problems of 
membrane theory can, in principle, be removed. 

Let us begin by quantizing the p-brane action, which can be performed, as 
with the string, either in the first or the second-order formalism. In the first- 
order formalism, we begin with the Nambu—Goto action, defined entirely in 
terms of derivatives of X,,. We construct the canonical momentum: 


ga be (13.6.14) 


> 
By an explicit calculation, we find that P“ exactly obeys the following 
constraints: 


PHYO, Xiee0; 

Pi =" det igkhoGe (13.6.15) 
where a = 1, 2,..., p. This means that the Hamiltonian of the system is zero. 
i.e., it consists entirely of the sum of these constraints: 

H =)o (Pi + T7detd,X"9,X,) +A; P“I;X,. (13.6.16) 
In the Gupta—Bleuler formalism, we can take the gauge where 4, = 0 and 


Ay = 1. The problem with the resulting Hamiltonian is that it is quartic in 
the string variables. This is the principle reason why the quantization of the 
membrane remains one of the outstanding problems of M-theory. 

The problem with quartic interactions persists in all gauges. such as the 
light cone gauge. For example, in the second-order formalism, we include the 
contribution from the world volume metric g,,,. We will break-up the world 


volume metric by separting out the zeroth time index. Leta. b = 1.2..... p 
represent the spacelike indices on the world volume, and let i, j = 0. 1, 
2,..., p represent the usual world volume indices. Then 
len bieetltag? |). S00 agel 13.6.17 
Sn = i 4 0. 
—@ "Apu? Rap 


where /7,,, represents the metric over the 2 x 2 spatial coordinates. Notice that we 
have exchanged the (p+ 1)(p+2)/2 components of g,; forthe p+1 coordinates 
w, u“, and the p(p + 1)/2 coordinates h,,. (This particular parametrization 
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is taken from the ADM variables used to quantize ordinary gravity in the 
canonical formalism. h* and w are then the usual shift and lapse functions.) 
By solving for some of the components of the metric tensor, we find, as usual, 
that h,, = 0,X'0,X"'. In this way, we recover the first-order formalism. 

The equations simplify if we let p = 2. If we introduce the notation 


{f, g} = 6° Ou f dng (13.6.18) 


then the gauge-fixed membrane action for the bosonic coordinates is 


I 2 I 1))\2 
s=;farfa | (30x — {w, X'}) =p? x(x", x?) 
(13.6.19) 
If we add in the fermionic variables, then the full Hamiltonian is 


H = [ eo [5 Pr + shew yay aw + 4 {X", X7}{X', x/}], (13.6.20) 


where y, = 0,X'y! and y = (i /2)e"0,X'0,X7y"". 

At this point, we again see all the pathologies of the membrane action. 
First, we notice that the interaction term is quartic in the string variable X’, 
meaning that it is extremely difficult to quantize the theory. The free theory of 
membranes is thus highly nonlinear and basically intractable. This alone has 
discouraged research on membranes. 

Second, and more important, we see that there are directions in which this 
quartic term actually vanishes. Since this term corresponds to the surface area 
of a membrane, we see that it vanishes if the membrane degenerates into a 
long, infinitely thin line with zero area. 

If we imagine that the membrane looks like a porcupine with millions of 
tiny “quills” emanating from it, we see that the wave function of a membrane 
can “leak” out from these quills, and hence the wave function is not stable. 

We also have the problem that the theory is ultraviolet divergent on the world 
volume, which means that quantization of the theory is problematic. 

Some insight into this problem can be seen if we compare this theory with 
the action of ordinary super-Yang—Mills theory defined with only one time 
coordinate. Let us introduce A’ = t“A“!, where t“ is a generator of U(N). 
Then the action becomes 


1 2 ee . 
S=5 far Ne {(DoA’) —1[A‘, A’][A’, A'] tiv Dow + vy'[A’, v]}. 
(13;6221) 
Remarkably, we see a strong resemblance between the two theories. 
Basically, we wish to make the following correspondence between U(N) 

Yang—Mills theory and membranes: 

A’ > Xx’, 
remap ant 


far Tr -> [ee 
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U(N) — w(oo). (13.6.22) 


The key is that we have taken the limit as N goes to infinity. In this limit, 
U(N) becomes U(0oo), which (if we carefully take the limit in a certain way), 
becomes w(oo), which corresponds to area-preserving diffeomorphisms for 
spherical membranes and torii. The two-dimensional coordinate (0), 02) of 
the membrane thus emerges out of the index a of the U() group. 

Now we have a regularization limit in which we can analyze the instability 
of the action. The Yang—Mills theory we have written, for finite N, is unstable, 
in that the quartic term vanishes if A’ takes values in the Cartan subalgebra 
of U(N), since the members of the Cartan subalgebra commute among them- 
selves. Thus, the “quills” have now been replaced by members of the Cartan 
subalgebra. Since the theory is unstable for finite NV, we can now show that the 
full membrane theory is also unstable if we let N — oo. 

Although this result is unfortunate, we now have an entirely different in- 
terpretation from before. In the matrix model interpretation, the membrane is 
seen as merely a certain collection of Dirichlet 0-branes. The instability of 
the membrane is now seen as a simple consequence of the “no-force” con- 
dition usually found for solitons. Thus, the matrix model has given us a new 
interpretation of the membrane as a peculiar bound state of Dirichlet 0-branes. 
(The 5-brane, which also exists in M-theory, is more subtle and is still being 
investigated.) 


13.7. Black Holes 


Because string theory is a quantum theory of gravitational interactions, we 
should be able to use it as a guide to probe delicate questions in black 
hole physics. In particular, there is the long-standing problem of deriving the 
Bekenstein—Hawking area-entropy relationship for black holes [20. 21] using 
statistical methods. Although entropy is rigorously defined using statistical 
methods via S = k log W, this definition requires a detailed understanding of 
the counting of the quantum states of the black hole, which traditionally has 
been beyond the reach of physicists. 

In the past, thermodynamic, rather than statistical. arguments have been 
used to heuristically define an entropy for black holes. First, we notice that, 
by direct calculation, the mass of a black hole is proportional to the area of its 
event horizon, so that 


dM = KdA, (13.7.1) 


87G 


where x is the surface gravity of the black hole. In particular, notice that dA 
is always positive, at least classically. For example, if a black hole absorbs 
a nearby star or two black holes collide, then the mass of the black holes 
expands and the area always increases. Thus, from purely formal arguments, 
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we suspect that there might be a relationship between the area of a black hole 
and its entropy, which should also increase. 

In particular, we wish to exploit the relationship between this equation and 
the laws of thermodynamics 


ie = Tds. (132722) 


where dS is the change in the entropy and d E is the change in energy. Notice 
that we have to define what we mean by temperature for a black hole. 

At first, defining a temperature for a black hole may seem odd, since nothing, 
by definition, can escape a black hole, since its escape velocity equals the speed 
of light. Although black holes have classical gravitational fields strong enough 
to absorb any stray pieces of cosmic matter, they must also obey the laws of 
quantum mechanics, which allows for quantum tunneling through the potential 
barrier. For example, the gravitational fields near a black hole are strong enough 
to convert virtual pairs, which are created in the vacuum, into real ones. In the 
same way that electron—positron pairs can be created from the vacuum via 
extremely strong electric fields, the black hole’s gravitational field is strong 
enough to create particle production from the vacuum. 

Physically, this means that quantum black holes must slowly radiate or 
“evaporate” with time, leaking their energy into space. It can be shown that 
the characteristic radiation emitted from such a black hole is that of a black 
body radiator with a definite temperature. Black holes can therefore be treated 
as thermodynamic objects with a well-defined temperature. We find 


T =K/2n. (138) 


This yields the Bekenstein—Hawking relationship, which states that the 
entropy S of a black hole is equal to one-fourth the area of its event horizon 


Se tA. (13.7.4) 
The analogy with thermodynamics becomes even stronger for spinning black 


holes. If a black hole has angular momentum J and the angular velocity of the 
horizon is Q2, then we have 


dM = (k/8m)dA+QdJ, (Sis) 
which has a striking resemblance to the usual law of thermodynamics 
aE =Jds —Pdv. (13.7.6) 


The defect in this derivation is that the arguments are only semiclassical 
and thermodynamic, while a complete derivation would necessarily count the 
quantum states of the black hole. Since the entropy is usually defined statisti- 
cally, there have been attempts over the years, none of them rigorous, which 
have tried to derive this relationship from statistical counting arguments. 

Recently, M-theory methods have been used to shed light on this problem 

_[22]. Since BPS states are found frequently when analyzing black holes, it is 
logical that we can use them to calculate their entropy. We know that D-branes 
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in particular can act as sources for R-R fields, and that the counting of D-branes 
for certain cases can be established. 

For many configurations of black holes, we find that the area of the event 
horizon is zero, depending on the charges of the black hole. In order to have a 
nonvanishing area, we start with the Type IIB action in five dimensions 


L 
S=— i e” f dbx/= g{R—(Vdy — SVD 
2D 
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Notice that there are three U(1) gauge fields: G = dA, which corresponds to 
the usual Kaluza—Klein field, (H4),.» = Ayys, and H_ = e°®+?*H, where « 
corresponds to the five-dimensional dual. 

We can then define three charges, two R-R (electric and magnetic) charges 
and one K-K charge 


1 
of >| e Pts (13.7.8) 
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In five dimensions, we can find a simple solution to these equations re- 
sembling a Reissner—Nordstrom black hole (a black hole with electric charge). 
Unfortunately, this black hole solution has zero area for the event horizon when 
the momentum P 1s zero. (In order to find a black hole solution with a nonzero 
area, all three charges, including the momentum P, must be nonzero.) This 
means that we must perform a boost of the Reissner-Nordstrom black hole to 
momentum P, parametrized by a variable ao. Then the boosted five-dimensional 
Reissner—Nordstrom solution is given by 


ee [ x a 7 come) ne 
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where r andr, represent the inner and outer event horizons of the black hole. 
Wheno = 0, we obtain the usual generalized Reissner Nordstrom metric. We 
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will be interested in the case where the black hole is“extremal.” (For a black 
hole with charge, we have the relationship between the mass M and the charge 
Q: M > Q, in suitable units. For an extremal black hole, M = Q. For our 
case, we have an extremal black hole when ry. = r_ = ro.) 

For this metric, we find the following charge relationship 


Q\05=rir°V/g". Glew) 
The momentum is given by 
RV. 
P= 3g? sinh (20 )(r? — r?). (13.7.13) 


(We will shortly take the limit 0 — oo as r_ — r,, such that P remains 
finite.) 
Then the energy E and area A become 


RV 
a 2g? [205 +r) + cosh*a(r>, — r?)), (13.7.14) 


A =4n'r? R cosh o,/r? — 2. (1327.15) 


(Notice that the area of the event horizon A vanishes if we take r_ > r, for 
finite 0, which forced us to make the boost to obtain a nontrivial result.) 
The area of the event horizon can be simplified as 


A = 82./0,05PR. (13.7.16) 


The Hawking radiation temperature for this metric can be shown to be 


2 ee 
Sess es (Ba 
2mri.cosh 0 
Putting everything together, we find 
A riVPR 
Se rik 20 = 27+ 0, OsPK. (13718) 


Since*P = n/R on the compactified circle, we find our final result for the 
entropy 


S28 yO Osn. (13.77.19) 


So far, all of our arguments have been semiclassical, totally ignoring the 
difficult problem of actually computing the statistical mechanical origin of 
the entropy. All we have done is recast known results in terms of the string 
background equations. 

The crucial step is that BPS states at strong coupling (where the black hole 
picture is valid, but where we do not know how to count states) should also hold 
for weak coupling, where the D-brane states can be counted explicitly. We focus 
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on the Dirichlet 1-brane (a string) and the Dirichlet 5-brane compactified on a 
small four-torus of volume (277)*V and a large circle given by circumference 
am. 

If we wrap the Dirichlet 1-brane around a circle of radius R, the string carries 
a charge Q; = |. Likewise, wrapping a Dirichlet 5-brane around a four-torus 
gives us a charge of Qs; = 1. By wrapping multiple D-branes around either 
the circle or the four torus, we can generate field configurations of arbitrary 
charges Q, and Q). 

As we mentioned earlier, the interactions of D-branes is given by open 
strings which stretch between them. The counting of states is then dominated 
by the open strings which terminate on Q, Dirichlet 1-branes and Qs Dirichlet 
5-branes. For the moment, let Q; = 1. Then the states of the theory are 
dominated by open strings which are free to move in the other four dimensions 
of the four-torus. This is the key to the entire problem: the counting of quantum 
states reduces to counting Q, ordinary strings which are free to move in four 
dimensions, i.e., the states include 4Q, bosonic strings and their fermionic 
partners. Fortunately, the counting of string states is well known. We recall, 
for ordinary string theory, that the degeneracy of harmonic oscillator states for 
a conformal field theory at level Lo = n, for large n, and for central charge c 
is given by 


d(n,c) > exp(27./nc/6) (37-20) 


for large n. 

Thus, we need to calculate c for these Dirichlet 1-branes. In a simple con- 
formal field theory with Ng species of bosons and Nr termions, c is given 
by 


c=Ng+3Nr, (13.7.21) 
where Ng = 4Q,. We thus find that 
d(n,c) > exp(Q27V/ Qin) (3322) 


for Os = 
But we know that the system is dual between the 1-brane and 5-brane. Thus, 
the final result should be written in terms of the product Q; Qs. Therefore we 


can relax the condition Qs = |, and write the answer in terms of the product 
Q, Qs. Thus, the final answer is given by S = log d, or 
S23 2n/ OO sn, (137.23) 


which agrees precisely with the previous result, thereby confirming that § = 
A/4. 

We see that the key to this calculation was the fact that the counting of 
statistical states was dominated by counting the states of a Dirichlet 1-brane 
(a string), which is well known. Although this result was derived for a five- 
dimensional black hole using Type IIB strings, the result is probably quite 
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general and has been shown to hold down to four dimensions for a variety 
of compactifications and backgrounds. (In four dimensions, this calculation 
requires the introduction of new objects, such as Kaluza—Klein monopoles.) 
Similarly, the calculation can be carried out for rotating black holes. 


13.8 Summary 


So far, we have seen the pervasive influence of p-branes throughout M-theory, 
which have allowed us to determine the nonperturbative behavior of many 
string theories. In this chapter, we construct the explicit representations of 
these p-branes. 

For example, we emphasize that we are forced to include the presence of 
these p-branes in string theory. Even if we originally start with a theory purely 
of strings, eventually we must solve for the equations of motions, which allow 
for solitons and solitonlike objects. Specifically, we can write the membrane 
solution of 1 |-dimensional supergravity by introducing the following ansatz: 


2 k; a“ ks : 2 2 2 
ds* = eee dx" dx, + ees (dy* + y?dQ5), (13.8.1) 


where p = 0, 1,2, m =3,..., 10, dQ; is the volume form for the S7 sphere, 
and the four-form field strength is proportional to the dual of the volume form 
on 57. 

Similarly, the 5S-brane solution of 1 1-dimensional supergravity can be written 
as 


’ kg -1/3 h ke 2/3 : 5 ; 
ds* = Les dx" dx, + La (dy a dO4)\ en(1G38:2) 


and the four-form field strength is proportional to the volume form on $4. 
Not surprisingly, we can now present the general case of the p-brane solution 
in D dimensions. We start with the action 
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where F’,;2 is the usual antisymmetric field strength. 
We couple this to the p-brane action 


Dp — 7 f ars (=$ v7 v7 x"9jX" eunesorr a 
] 
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where y/ is the metric on the world volume of the p-brane. 
To solve these coupled equations, we start with the ansatz 


ds? = e*4dx" dx, + e dy” dy”, (13.8.5) 


pti 


giilavdpst 9, x" _ aap, fla 1 Aut ott) . (13.8.4) 
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where up = 0, 1,2,..., pandm = p+ 1,p+2,..., D—1. Similarly, we 
split the p-brane coordinates as follows: (XM SXk ok” and’ Xr Fh pand 
Y™ = constant. 

Then a solution can be given as 


Oe at ee 
XD ay (C7 Ah!) 
eee ae 
B= 75 (C — ado/2), 
a a’ 
5b = {(C — ado/2) + ago/2, 
14-2 1D p= 3) — 2), (13.8.6) 
and 
x k 
eabo/2 1 ae D—p-3>0, 
oe vy (13.8.7) 


2 
ie 
eee? — in y, D—p-3=0, 


where k = 2«?T/(D — p — 3)Qp-_p-2 and & is the volume of a hypersphere. 

So far, we have only constructed classical solutions corresponding to the 
membrane and 5-brane. To construct the explicit supermembrane action, we 
introduce the generalized Nambu-—Goto term 


5} = -T a ay —det I]; : Nn (13.8.8) 


where I], = 0X" —iOT "9,0. This term by itselfis not locally supersymmetric, 
sO We must introduce a Wess—Zumino term. We first introduce the form /?: 


i a 
= pee ce DMO ede. (13.8.9) 
Writing h = db, we can then write the Wess—Zumino term of the action as 
S,= -ar | *b=-— aE [ artactoiny j (13.8.10) 
z (p me 1)! CLs p4t * poe 


Written out explicitly, the Wess-Zumino term for the membrane is 
i aye a 
S. = -— [@o {(e*6P,,9,6) [TE My + i46P’3,6 (13.8.11) 
— (4)(6T*9;6)6r’46)]} , (13.8.12) 


l ” 
a — i — ft I: a ijk Lh uv o 
PMT Ty = MMT. 0 = epee TET 


hp: 


(1358-13) 
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We see therefore that 6S = 0 if we choose 
(eas oat I 66 = LP )e(@); (13.8.14) 


where (1 + I) is a projection operator. 

The 11-dimensional membrane and the 5-brane are sometimes called the 
M-branes of M-theory. However, we are also interested in the lower p-branes 
found in the BPS states. These include the D-branes. 

A Dirichlet p-brane (i.e., Dirichlet p-brane) can be defined as a p-brane 
on which open strings can end. The action for these D-branes differ from the 
usual one, in that we can also add a vector field A which propagates on the 
world volume. The addition of this vector field changes the usual counting of 
the brane-scan, which now becomes 


D-brane:D — 2 = 4MN. (13.8.15) 


It is easy to see that the various BPS p-branes found by decomposing the 
supertranslation algebra can be satisfied by this condition. 

Historically, D-branes were first discovered by making a T-duality transfor- 
mation on open strings. We recall that a T7-duality transformation on a closed 
string results in 


XiG) XG) 4, 1.0) (13.8.16) 


However, for an open string, this same 7-duality changes the Neumann 
boundary condition for X,, into a Dirichlet boundary condition for X,,: 


d,X,=0 —-> 4X, =0, : (13.8.17) 


or X » = 0 at the endpoints, where we y is the dual string. 

The action for the D-brane can be found in several ways. As with the ordinary 
closed string, we can first couple the open string to the massless fields (a vector 
field and a fermion) and then integrate out the higher string degrees of freedom. 
This gives us the Beta functions, which are then set to zero. Then it is an easy 
matter to construct the action whose equations of motion are precisely B = 0. 

The resulting action for the massless fields is 


. [ey —det(G;, + ay, = B;;); (13.8.18) 


where f" = X* atthe boundary of the p-brane, and G and B are the usual terms 
formed from the pull-back to the membrane surface, i.e., Gi; = 0; f" 0; f" Qyuv- 

We can also derive the D-brane action in another way, by taking the |1- 
dimensional membrane action and then compactifying it ona circle. We obtain 
the same Dirac—Born-Infeld action. 

One of the most surprising results of D-brane physics is the M(atrix) model 
conjecture, that M-theory in the infinite momentum limit is equivalent to 
Dirichlet 0-branes in the large N limit. Since M-theory is known to contain 
membrane and 5-branes, this is a truly remarkable simplification, since the 
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action for Dirichlet 0-branes is given by a super-quantum—mechanical system, 
which is not even a quantum field theory. 

The motivation for this result can be seen by analyzing the super-translation 
algebra for M-theory. We recall that after compactification, the momentum 
operator Py becomes p, and pj;, where 


Piu=Z=N/R. (13.8.19) 


In 11 dimensions, this last term is the usual Kaluza—Klein state. But in 10 di- 
mensions, this corresponds to the Dirichlet 0-brane state, which now dominates 
the algebra in the infinite momentum frame. 

There are several tests of this conjecture. The simplest is to analyze the 
spectrum of the theory. The super-Yang—Mills theory has a 16-component 
spinor, which can be used to derive 2° = 256 states, which correspond to the 
massless states of supergravity. 

Second, we can explicitly calculate Dirichlet 0-brane scattering amplitudes 
and show that they match those of supergravity scattering. The action for N 
coincident D-branes is given by the super-Yang—Mills theory reduced to the 
world volume of the p-brane 


S= T [aro Tr (—} Fu, FY +iWl’ D,W) +---. (13.8.20) 


What is highly unusual is that this action is also applicable to multi-D-brane 
fields which are not coincident, so this action can be used to describe D-brane 
scattering. We use this fact to calculate Dirichlet 0-brane scattering. We begin 
with the action for N Dirichlet 0-branes 


CZ r fat Tr (§(Dox'? — 167 Do + 4 ([X'. XJ) +07 y![X,.6]). 
(iS 829 
This action not only describes N overlapping Dirichlet 0-branes, it also 
describes the scattering process of several Dirichlet 0-branes. For example, if 
the gauge field is block diagonal, obeying the the gauge group 


U(N) @ U(N2) @ --- @ U(N,), (13.8.22 


then we can use this to represent multi-D-brane scattering. 

This can be used to explicitly solve for two Dirichlet 0-brane scattering. We 
first power expand the X field as follows: X' = B' + J/2Y' where B' labels 
the classical configuration and Y' represents the quantum fluctuations. 

The power expansion yields 


Sy = if ac (SY (05 — r?)¥j + LY}? — r*yyj + Lyiazy! 


— fge* ee Biylyiys — 5 ate gos eIviviv;). @aise23) 
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If A is the zeroth component of the gauge field, then we can similarly write 
the action for this field 


Sa=i | dt (3A1(87 — r?)A; + $A2(8? — r?)Ar + 4A307A; 
+ 2639, BiAY! + /pe%”d, Y'A,Y! 
— fge™* eo BIA A,Y! — Se eet AYLACY)). (13.8.24) 


It is now a simple matter to functionally integrate out the Y' quantum fields to 
lowest order. Since the B’ are classical fields, the above action can be written as 
the sum of quadratic, cubic, and higher terms. The quadratic terms can also be 
diagonalized so they represent the propagation of free particles with a certain 
mass. For a single field, the integration over the quadratic part yields the usual 
determinant 


= det(—8, +r? +a). (13.8.25) 


By including all these massive fields, we find that the functional integral is 
modified by the determinant 


De=— DD ape Dp) De (13.8.26) 


By explicitly decomposing the above determinants, we can then extract the 
contribution of these determinants to the action. We find 


15 4 6 
ee ao) (=) (13.8.27) 


16r7 ri 


which matches the result found in ordinary supergravity scattering. 

Lastly, we find that D-branes allow us to solve a long-standing problem 
in black hole physics, which is to find a statistical mechanical derivation of 
the Bekenstein—Hawking radiation formula, which states that the entropy of a 
black hole is equal to the one-fourth the area. 

The fact that the entropy of a black hole is proportional to the area is related 
to the fact that the black hole gobbles up new interstellar matter, which makes 
its area increase, rather than decrease, in size. Like entropy, the area of a black 
hole always increases. Although thermodynamic arguments have been made 
which plausibly give rise to this relation, a statistical derivation necessarily 
involves a quantum theory of gravity, which was always beyond reach. 

The introduction of D-branes changes all of this, allowing us to count the 
quantum states in a black hole. Let us start with a Type IIB theory compactified 
down to five-dimensions. The effective action for the fields are 


Ses = dx./—8 {Re R~ (V9)? — (VY 


_ 1,-2D426 772 _ 1,-2D-26 py? _ 
ae Join AG fale 


(13.8.28) 


542 13. Solitons, D-Branes, and Black Holes 


We can integrate over the fields to obtain three charges which describe the 
black hole 


OT =| Ci tie (13.8.29) 
Re 
0. = | f .-0-% oe, (13.8.30) 
4r? S; 
Pees =/ > *G, (13.8.31) 
1G 16m S3 


In general, we need all three charges to be nonzero in order to obtain a 
nonzero area for the black hole. For the Type IIB string, there are D-branes 
in odd dimensions, specifically Dirichlet 1-branes and Dirichlet 5-branes. If 
we compactify on a four-torus and a circle, we can wrap the Dirichlet 1-brane 
around the circle (giving us Q, = 1) or the Dirichlet 5-brane around the toril 
(giving us Qs = 1). Let Qs = 1. Let us wrap Q, Dirichlet 1-branes around 
the circle. Then we effectively have Q, Dirichlet 1-branes (strings) which are 
free to oscillate in four dimentions. Thus, we have a sigma model defined in 
4Q, dimensions. 

Fortunately, we know how to count the number of states in a string theory. 
For large n, the degeneracy of states of the string is given by 


d(n,c) — exp(27./nc/6) (1388.32) 


for large n. For our case, c is equal to Ng + 1N,-, where Ng = +Q;. But since 
the system is dual, the charges must come in the combination Q;Q<. so we 
can relax the condition QO; = 1. 

Since S = k log W, we can now put everything together and get the entropy 


S Sy Oar. ( 1 SSsee)) 


which is precisely one-fourth of the area. 

Although we chose Type IIB theory compactified down to 5 dimensions. 
this result has been generalized to a wide variety of other kinds of black holes. 
such as four-dimensional black holes and spinning black holes. 


13.9 Conclusion 


At present, string theory and its recent reformulation, M-theory, provides the 
most promising hope for a unified theory of all physical forces. Not only does 
string theory combine gravity with the other quantum forces, it also is rich 
enough, in principle, to explain all the symmetries found in nature. In this 
respect, the theory has no rivals. There are, however. a number of stringent 
tests that the theory must pass: 
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e The outstanding problem of string theory is to find the correct vacuum 
which describes our physical universe. Among the millions of vacua dis- 
covered in four dimensions, one must be found which can reproduce the 
Standard Model with symmetry SU(3) ® SU(2) @ U(1) coupled to gravity. 

e Many of the marvelous properties of superstring theory come from local 
supersymmetry, but eventually supersymmetry must be broken if the theory 
is to describe the physical universe. So far, no convincing method has been 
proposed to yield supersymmetry breaking. 

e The theory must explain the value of the cosmological constant, which 
is known to be vanishingly small. This problem is closely tied to super- 
symmetry breaking, since the cosmological constant is zero in a locally 
supersymmetric theory. 


The key to all of these problems may lie in a nonperturbative formulation 
of the theory. At present, each of the various proposals has problems. Confor- 
mal field theory, for example, is basically a perturbative formulation of string 
theory, and cannot give us nonperturbative information. String field theory, al- 
though it is defined independent of perturbation theory, has proven too difficult 
to formulate and solve. 

So far, our best hope has been with M-theory. From an | 1-dimensional per- 
spective, all five superstring theories can now be viewed as different vacua of 
the same theory. A vast web of duality relations has established a nonperturba- 
tive link between different string theories. The strong coupling limit of many 
string theories is now revealed to be another string theory. 

Furthermore, the proliferation of D-branes in the theory has given us insight 
into the nonperturbative spectra of the theory, allowing us to categorize the 
BPS states and calculate the quantum properties of black holes. 

There are still enormous hurdles for M-theory. Some of the outstanding 
problems include: 


Establishing its complete structure in 11 dimensions. So far, all we know is 
that it reduces to | 1-dimensional supergravity in its low-energy limit. 
Quantization of M-theory. Unless M-theory is quantized, we will never under- 
stand its spectra and rigorously understand its properties. Unfortunately, even 
simple membrane actions cannot be quantized with known techniques. 
Understanding its interactions. At present, very little is known about mem- 
brane interactions. Excitations of D-branes may be given in terms of strings, 
but there are large gaps in our understanding of how membranes interact with 
each other. 

Completing the web of dualities down to D = 4. Although a considerable 
amount of work has been done on dualities in D = 8 and D = 8, very little 
is still known about dualities in D = 4 with N = | supersymmetry, which is 
the theory of physical interest. 

Understanding its phenomenology. M-theory still cannot explain why super- 
symmetry is broken, or give us insight into the cosmological constant. 
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However, given the astonishingly rapid rate at which the theory is devel- 
oping, there is considerable optimism that the mystery behind M-theory will 
soon be revealed. Perhaps some of the readers of this textbook will be among 
those who accomplish this feat. 
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Appendix 


Because the mathematics of superstring theory has soared to such dizzying 
heights, we have included this short appendix to provide the reader with a 
brief mathematical understanding of some of the concepts introduced in this 
book. We apologize that we must necessarily sacrifice a certain degree of 
mathematical rigor if we are to cover a wide range of topics in this appendix. 
However, the interested reader is advised to consult some of the references 
listed later for more mathematical details. 


A.l_ A Brief Introduction to Group Theory 


A group G is a collection of elements g; such that: 


(1) There is an identity element /. 
(2) There is closure under multiplication: 


Sil 2S 3p = [82a 


(3) Every element has an inverse: 
8 X 8; b= 
(4) Multiplication is associative: 
(81 X 8j) X 8k = 8 X (8; X 8). 


Groups come ina variety of forms. Specifically, we have the discrete groups, 
which have a finite number of elements, and the continuous groups, such as the 
Lie groups, which have an infinite number of elements. Examples of discrete 
groups include: 
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(1) The Alternating groups, Z,, based on the set of permutations of n objects. 

(2) The 26 sporadic groups, which have no regularity. The largest and most 
interesting of the sporadic groups is the group F;, commonly called the 
“Monster group,” which has 


gt 78 Pe © 13° 17 19323 29 ee 
elements. 


In this book, however, we mainly encounter the continuous groups, which 
have an infinite number of elements. The most important of the continuous 
groups are the Lie groups, which come in the following four classical infinite 
series A, B, C, D when we specialize to the case of compact, real forms: 


An = SU(n + 1), 

B, = SO(2n + 1), (A; Val 
C, = Sp(2n), 

D, =S0eGw. 


as well as the exceptional groups: 
Ga; Fa, Beek es. Oe) 


of which E, and Ex are the most interesting from the standpoint of string 
phenomenology. 

Let us give concrete examples of some of these groups by analyzing the 
set of all real or complex n x n matrices. Clearly, the set of arbitrary n x n 
invertible matrices satisfies the definitions of a group and hence is called the 
group GL(n, R) or GL(v, C). The notation stands for a general linear group of 
n Xn matrices with real or complex elements. If we take the subset of GL(. R) 
or GL(n, C) with unit determinant, we arrive at SL(7. R) and SL(n. C), the 
group of special linear n x n matrices with real or complex elements. 


O(n) 


Now let us take a subgroup of GL(n. R), the orthogonal group O(n), which 
consists of all possible # x n real invertible matrices that are orthogonal: 


0x o' =1. (A.1.3) 


This obviously satisfies all four of the conditions for a group. Any orthogonal 
matrix can be written as the exponential of an antisymmetric matrix: 


0) a Ae (A.1.4) 


It is easy to see that 


i 


O'=e4 =e4= 9", (A.1.5) 
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In general, an orthogonal matrix has 
in(n — 1) 


independent elements. Thus, we can always choose a set of n(n — 1) linearly 
independent matrices, called the generators 4;, such that we can write any 


element of O as 
(1/2)n(n—-1) 
Sc) OSs 0>,| (A.1.6) 


iI 


The real numbers p’' are called the parameters of the group, and there are thus 
inn — 1) parameters in O(n). The number of parameters of a Lie group is 
called its dimension. The commutator of two of these generators yields another 
generator 


Pil 2b (A.1.7) 


where the f’s are called the structure constants of the algebra. Notice that the 
structure constants determine the algebra completely. 

Notice that if we take cyclic combinations of three commutators, we get an 
exact identity 


eG l= 0: (A.1.8) 


By expanding out these commutators, we find that the combination identically 
cancels to zero. This is called the Jacobi identity and must be satisfied for the 
group to close properly. By expanding out the Jacobi identity, we now have 
a constraint among the commutators that must be satisfied, or else the group 
does not close 


rh ap (A.1.9) 


Of course, the set of orthogonal matrices closes under multiplication. A 
more complicated problem is to prove that this particular parametrization of the 
orthogonal group, with generators and parameters, closes under multiplication. 
Let us write 


e*eF = &€, (A.1.10) 


Fortunately, the Baker—Hausdorff theorem shows that C equals A plus B plus 
all possible multiple commutators of the A and B. But since the A and B 
satisfy the Jacobi identities, the set of all possible multiple commutators of 
A and B only creates linear combinations of the generators. Thus, the group 
closes under multiplication. 

Notice that the structure constants of the algebra form a representation, called 
the adjoint representation. If we write the structure constants as a matrix: 


See (A.1.11) 


Thus, the structure constants themselves form a representation. 
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We can always choose the commutation relations to be 
[M??, M“] = 6% Mm" = 644 mM" ae §°4 me" _ 5° M4 (AT 
for the antisymmetric matrix M?’. 
One convenient representation of the algebra is now given by 
(M*);; ~ 678; — 8755 (A.1.13) 
which, we can show, satisfies the commutation relations of the group. 
Let us define a set of n elements x; that transforms as a vector under the 
group 
40, %;. (A.1.14) 
In general, we can also define a tensor 
tT, see (ALT IS} 


of rank P that transforms in the same way as the product of P ordinary vectors 
Soe: 


(A.1.16) 


In addition to the vector and tensor representations of O(N), we have the 
spinor representation of the group. Let us define the Clifford algebra: 


(P?, Foy 25”. (A.1.17) 


Now define a representation of the generators in terms of these Clifford 
numbers 


Ty tascntt = Ov, m1 Ov uw EY Ove.up Tuy .u2 oe [ps 


M? — Vis Fd. (ALS) 
i 
The Clifford numbers themselves transform as vectors 
[M? Vr] = i(6"r? — 6”T*). (A.1.19) 
In general, these Clifford numbers can be represented by 2” x 2” matrices 
(Tap (A.1.20) 


for the group O(2n). Therefore, a spinor y, that transforms under O(271) has 
2” elements and transforms as 


ve = (eM) as, (A.1.21) 
where the M’s are written in terms of the Clifford algebra and the p variables 
are parameters. 

For the group O(2n + 1), we need one more element. This missing element 
is 
pee Sele oe (A.1.22) 


We can easily check that this new element allows us to construct all the M 
matrices for O(2n + 1). 
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Let us now try to construct invariants under the group. Orthogonal 
transformations preserve the scalar product x;x;: 


x;X; = invariant. (A.1.23) 
biog = On x7. then 
x, == XO) Ox = xix (A.1.24) 
This invariant can be written 
Oy. (An125) 


where the metric is 4;;. In principle, we could also have a metric with alter- 
nating positive and negative signs along the diagonal, n;;, which would create 
a parameter space that is noncompact. If we have N positive and M nega- 
tive elements in 7;;, then the set of matrices that preserve this form is called 
O(N, M): 
(O7)ijnjxOu = nits 
nj = (1)d;;, (A.1.26) 
ei) =r], 
If all the elements of ¢ are positive, this gives the group O(n). If the signs are 
alternative, then the group is noncompact. Special cases include 
projective group: O..1); 
Lorentz group: OG; 1); 
de Sitter group: O(4, 1), ARIZ) 
anti-de Sitter group: O(3, 2), 
conformal group: O(4, 2). 
For example, the de Sitter group can be constructed by taking the generators 
of O(4, 1) and then writing the fifth component as 


P? ~ M4, (A.1.28) 
Thus, the algebra becomes 
q [P?, P”] a M?, 
OCT a Sh ea ier cae (A.1.29) 


[M??, M7] = nM ae 
Notice that this is almost the algebra of the Poincaré group. In fact, if we make 
the substitution 
Po oscrP, (A.1.30) 


then the only commutator that changes is 


l 


fee, (A.1.31) 
i 
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r is called the de Sitter radius. This means that if we go around a circle in 
de Sitter space and return to the same spot, we will be rotated by a Lorentz 
transformation from our original orientation. Notice that ifr goes to infinity, we 
have the Poincaré group. Thus, r corresponds to the radius ofa five-dimensional 
universe such that, ifr goes to infinity, it becomes indistinguishable from the flat 
four-dimensional space of Poincaré. Letting the radius go to infinity is called 
the Wigner—Inonii contraction and will be used extensively in supergravity 
theories. After the contraction, the de Sitter group becomes the Poincaré group. 


SU(n) 


The group SU() consists of all possible n x n complex matrices that have unit 
determinant and are unitary: 


UU =. (A.1 32% 


The notation stands for special unitary 1 xn matrices with complex coefficients. 
Any unitary matrix can be written as the exponential of a Hermitian matrix 
A = A: 


Wieen. (A.1.33) 
We can show that 
Ut = et — tH — yl, (A.1.34) 
Let n elements in a complex vector u, transform linearly under SU(11): 
=U; ae. (A.1.35) 


The n complex vectors u; generate the fundamental representation of the group. 
Then an invariant can be constructed: 


u;u; = invariant. (A.1.36) 
If u; = U;;u;, it is easy to check that 
uu, = ut(U");Ujpug = utu;. (A357) 


The metric tensor for the scalar product is again 4,,. If we were to reverse 
some of the signs in this diagonal matrix, the groups that would preserve this 
metric are called SU(N, M). Anexample of this would be the conformal group: 
SU@;2). 

Any n x # complex traceless Hermitian matrix has n° — 1 independent 
elements and hence can be written in terms of 7° | linearly independent 
matrices 4;. Thus, any element of SU(n) can be written as 


n*—| 
U =exp|i >> pA; |. (A.1.38) 


| 
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The Baker—-Hausdorff theorem then guarantees that the group closes under this 
parametrization and that we can write the algebra of the group as 


(27 at AL. (A.1.39) 


Again, knowledge of the structure constants determines the algebra completely. 
We can also construct representations of SU(n) out of spinors. If we have 
the group O(2n), then SU(”) is a subgroup. If we construct the elements 


Ad = S(04e S71), (A.1.40) 


where -’/ are Grassmann variables, then the generators of SU(n) can be written 
as 


a > ANA) A‘ (A.1.41) 
ALS 


Thus, we have an explicit representation for the inclusion 


SU(n) C O(2n). (A.1.42) 


Sp(2n) 


The symplectic groups are defined as the set of 2n x 2n real matrices S that 
preserve an antisymmetric metric 77: 


(S7):jn jx Sea = Nil, ; (A.1.43) 
where 
u; = Sijltj (A.1.44) 
and 
0 1 CeO 
—1 0 0 O 
nij = 0 O 0 l (A.1.45) 
7 0 oO -1 0 
Accidents 


Fortunately, there is a series of “accidents” that allow us to make local 
isomorphisms between groups. For example, O(2) is locally isomorphic to 
U(1): 


O20). (A.1.46) 
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To see this, we simply note the correspondence between a matrix element of 
O(2) and U(1): 


pa en re (A.1.47) 
—sin@ cosé 


Thus, they have the multiplication law 6; + 6) = 63. 
Another accident is 


0(3) = SU(2). (A.1.48) 


The easiest way to prove this is to note that the Pauli spin matrices 0, are 2 x 2 
complex matrices with the same commutation relations as the algebra of O(3). 
Thus, 


erin Ai Fi edini (60; (A.1.49) 


where the matrix on the left is a 3 x 3 orthogonal matrix and the one on the 
right is a unitary matrix. 
Another useful accident is 


O(4) = SO(2) ®@ SU(2). (A.1.50) 


To prove this, we note that the generators M'/ of O(4) can be divided into two 
sets: 


A = {Mj2. + M34, M3; + Mo4, Mo3 + Mia} (A.1.51) 
and 
B = {M)2 — M34, M3; — Mra, M23 — Mya}. (Ant52) 


Notice that the A and B matrices separately generate the algebra of O(3) and 
that 


[A, B] 0. (A.1.53) 


Thus, we can parametrize any element of O(4) such that it splits up into a 
product of O(3) and another commuting O(3). Thus, we have proved that an 
element of O(4) can be split up into the product of two elements of a commuting 
set of SU(2) groups. 
Unfortunately, these accidents are the exception, rather than the rule, for Lie 
groups. We list some of the accidents: 
dimi=.3 
SU, c) ~ SOG, 7) ~ Usp?) ~ Ulloa) = Siig), 
SU(I, 1;c) ~ SO@PIyr) ~Se@. 7) ~ Sites). (Ac154) 
dim = 6 
SO(4, r) ~ SU(2, c) ® SU(2, c), 
$O0*(4) ~ SU(2, c) @ SL(2, r), 
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SOG. Isr) = SL(2,c), (ales) 
SO@i2 7) ~ SL, 7) @ SLO.) 
dim = 10 
SO(5, r) ~ Usp(4), 
SO(4, 137) ~ Usp(2, 2), (A.1.56) 
SO(3, 2;r) ~ Sp(4, r). 
dim = 15 


SO(6, r) ~ SU(4, c), 
SO(5, 1;r) ~ SU*(4) ~ SLQ, q), 
5O"(6) ~ SUG; Iic), (A.1.57) 
S@@ 2:7) ~ SUC, 2: c), 
SO(3, 3;r) ~ SL(4, r). 
For arbitrary NV, we have 
SUCNsg) = SU (2N), 
U(N, g) = Usp(2N), 
Sp(N, q) = Usp(2N), (A.1.58) 
O(N, gq) = SO*(2N). 
Also, for N < 6, we have 


Spin(3) = SU(2), 
Spin(4) = SU(2) ® SU(2), 
Spin(5) = Usp(4), (A.1.59) 
Spin(6) = SU(4), 
where SL(n) is the set of all n x n matrices with unit determinant that can have 


real, complex, or quaternionic elements g. Quaternions are generalizations of 
complex numbers such that any element can be written as 


3 
q= oc, (A.1.60) 
i=0 
where the c’s are real numbers and 
Gal. 
die =—I, 
91192 = —4291 = 93; (A.1.61) 


1293 = —9q2 = 11, 
Bq = —9193 = Q.- 
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Cartan—Weyl Representation 


In general, the exceptional groups do not have such a nice representation in 
terms of subgroups of GL(n). We will instead use the methods of Lie and 
Cartan. 

Among the generators 4; of a group, let us select out those elements H; 
which are all mutually commuting: 


[H;, Hj] =0. (A.1.62) 


This is called the Cartan subalgebra. The number of elements in the Cartan 
subalgebra is called the rank r of the group. Let us call all the other elements 
of the algebra E. What are the commutation relations between H and the 
other elements E? In general, we cannot have the commutator between H 
and E yielding another H, because this would not satisfy the Jacobi identi- 
ties. Therefore, the commutator of H and E must yield another E. We can 
always rearrange the rest of the elements E of the algebra so that they become 
eigenvectors of the H’s. We will denote them by 


ji (A.1.63) 
where q is called a root vector in r-dimensional space. In general, these root 
vectors live in r-dimensional space. Notice that if the group has N parameters, 
then there are VN —r elements in E. Thus, there are N — r root vectors living in 


r-dimensional space. We can always take linear combinations of the various 
E’s until we have the eigenvalue equation 


[Hi, Ea] = aj Ea. (A.1.64) 
By the Jacobi identities, the other commutation relations become 
[Eas Bae = Odds, 


[Eau esl = Napears (A.1.65) 
The N’s are the structure constants of the group. They are given explicitly by 
a3 = 3n(m + 1)aja;. (A.1.66) 


The symmetries of the N’s are given by 


Na.p = — Nate = = No gee 
No. = Ng,-a-8 = N-a-6.c- (ARNO?) 


Dynkin Diagrams 


Let us make a few definitions concerning the roots. Each root vector lives in 
an r-dimensional space. Thus, we can choose a subset of them with r elements 
such that any other root can be written as a linear combination: 


pac 0, (A.1.68) 
t=I 
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A root is called a positive root if the first nonzero c in the above equation is 
positive. A simple root is a positive root that cannot be written as the sum of 
two positive roots. Just as the structure constant specifies the algebra exactly, 
the same can be said of the Cartan matrix, which is anr x r matrix defined by 
Jy ee (A.1.69) 
(a jp (04 i 

where the a; are the r simple roots. 

The diagonal elements of the Cartan matrix, by construction, are all equal to 
2. But the matrix is not necessarily symmetric. In fact, it can be shown that the 
only possible values of the off-diagonal matrix elements are 0, —1, —2, —3. 
Because the Cartan matrix defines the group entirely, some simple properties 
of the Cartan matrix can be used to define the group graphically. The most 
convenient of these is the Dynkin diagram. 

For a group of rank r, let us draw r dots. Each simple root is thus represented 
by a dot. Now connect the ith and jth dot with n lines, where n is equal to the 
product of the off-diagonal elements 


ra= AijAji- (A.1.70) 


The resulting diagram is a series of dots connected by a series of multiple lines, 
called the Dynkin diagram. The power of Dynkin diagrams is that they uniquely 
specify the structure of any Lie group, and thus we can visually distinguish the 
various Lie groups. 

Since the elements of the Cartan matrix are related to the scalar products on 
the root lattice space, we can also represent n in terms of the angle 6 between 
two root vectors: 


n 6 

0 90° 
iL AXOF 
2 Wi" 
3) SOF 


If we specialize to compact, real forms, we have the following Dynkin 
diagrams (see Fig. A.1): 
Ap = SUG), 
e; — ej, Lai Fy snk 
B, = SO(n + 1), 
eee e 
+e;, 
C, = Sp(2n), 
coOmeaes, 
+2e;, 
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A, «——e——* see ——) 
B, o_e—__-® . ——— |) 
Qa, On—-1 Oy 
——— ) C—— _} 
C, a a 
a, As a-1 n 
A, 4 
Di? —* 
a, Ao An_2 ey 
n 
(ex ——— } 


FIGURE A.1. Dynkin diagrams for various Lie algebras. The number of dots rep- 
resents the rank of the group. The number of lines connecting the dots depends on 
the angle between two root vectors. Dynkin diagrams give us a convenient method 
of visualizing the entire structure of a Lie algebra. 


D, = S0Cn), 
seen e;, 
Go. 
e; — €;. 
=c(e; + e; — Zen), P<ifsyF#k <3, 
Bigs 
2; GE Oy lzi 27 = 4. 
+2e;. 
ze, te, te, t+ ey, 
E«, 
astreee,, ea fo: 
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g(te; +e) +--+ + es) + (2 — 8)!e,, 
where we have an even number of + signs in the last expression. 
E}, 
+e; + e;, NWS ie a ae 
+/2e,, 
$(te; te, +---+e5)+ (2 — $)'e,, 
where we have an even number of + signs in the last expression. 
Ex, 
ae eee Pere yes 
(te; te, +--+» +e7)+ tes, 


where we have an even number of + signs in the last expression. 
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The most general reparametrization of space-time, a general coordinate 
transformation, is given by 


x" = ¥4(x), (A.2.1) 


Under this reparametrization, we use the chain rule to find that the differentials 
and partial derivatives transform as 


Ox 
dx" = —dx’, 
Ox” 


a ax” @ 

Axe Ax AxY” 

We say that the differential dx” transforms contravariantly and the derivative 
0, transforms covariantly. In direct analogy, we now define vectors that also 


transform in precisely the same fashion: 


(A.2.2) 


ae ; ox” 
ave = ax' Vo 
0 iu 
BY = ~ BY. (A.2.3) 
See 


A tensor Tas simply transforms like the product of a series of vectors. The 
number of indices on a tensor is called the rank of the tensor. 
It is now easy to show that the contraction of a covariant and a contravariant 


tensor yields an invariant: 


A,B” = invariant. (A.2.4) 
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We can show that the partial derivative of a scalar is a genuine vector: 


Ox” 
a. = — do. Aga 
1d = —— 0 (A.2.5) 


The fundamental problem of general covariance, however, arises because 
the partial derivative of a tensor is not itself a tensor. In order to rectify this 
situation, we are forced to add in another object, called the Christoffel symbol, 
which converts the derivative into a genuine tensor: 


VuAy = d,Ay +14, Aa. (A.2.6) 
We demand that 
deb ae 
Vea) = —V, Ay: Auli 
(VuAv) OxeHinoee! ( 


This, in turn, uniquely fixes the transformation properties of the Christoffel 
symbol, which is not a genuine tensor. 

We can, of course, now define the covariant derivative of a contravariant 
tensor: 


Vien = ag PA’, (A.2.8) 


1UA 


as well as the covariant derivative of an arbitrary tensor of rank r. 

So far, we have placed no restrictions on the Christoffel symbols or even the 
space-time. Now let us define a metric on this space by defining the invariant 
distance to be 


ds” = dx" pax’, (A.2.9) 


where the g is the metric, which transforms as a genuine second-rank tensor. 
Now let us restrict the class of metric we are discussing by defining the 
covariant derivative of the metric to be zero: 


Vu8va = 9. (A.2.10) 

Notice that there are 
Dx 4D(D +1) A2tg 
equations to be satisfied, which ts precisely the number of elements in the 


Christoffel symbol if we take it to be symmetric in its lower indices. Thus, we 
can completely solve for the Christoffel symbol in terms of the metric tensor: 
7 = aft 
I wv & Pyewp- 


C pivgs = 4 (Ou 8vp fe Ov8up a OB8uv)- (AZ. 


Notice that we assumed the Christoffel symbol to be symmetric in its lower 
indices. In general, this is not true, and the antisymmetric components of the 
Christoffel symbol are called the torsion tensor: 
— A A 
Toe oer 


uv vu" 


(e213) 
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In flat space, we have the equation 
[ozs ey] 0: (A.2.14) 


Since the derivative of a field generates parallel displacements, intuitively this 
equation means that if we parallel transport a vector around a closed curve in 
flat space, we wind up with the same vector. 

In curved space, however, this is not obviously true. The parallel displace- 
ment of a vector around a closed path on a sphere, for example, leads to a net 
rotation of the vector when we have completed the circuit. 

The analog of the previous equation can also be found for curved manifolds. 
We can interpret the covariant derivative as the parallel displacement ofa vector 
and the Christoffel symbol as the amount of derivation from flat space. If we 
now parallel displace a vector completely around a closed loop, we arrive at 


[Vu Viola, = Rhy Ae, (A.2.15) 
where 


a 
oe 


top eee el ee lol (A.2.16) 


vB pa* 


Let us now try to form an action with this formalism. We first note that the 
volume of integration is not a true scalar: 


d?x = det =| a’ x (A.2.17) 


To create an invariant, we must multiply by the square root of the determinant 
of the metric tensor: 


ox! 
Ox? 


rae det| = (A.2.18) 


The product of the two is an invariant: 
/—gd?x = invariant. (AZ. 19) 


Notice that the square root of the metric tensor does not transform as a scalar, 
because of the Jacobian factor. We say that it transforms as a density. 
Notice that the curvature tensor has two derivatives. In fact, it can be shown 
that the contracted curvature tensor 
Rote = R (A.2.20) 


pva 


is the only genuine scalar we can write in terms of metric tensors and Christof- 
fel symbols with two derivatives. Thus, the only possible action with two 
derivatives is 


= | LOU EER: (A.2.21) 


—|1 
2K? 
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This formalism, however, cannot be generalized to include spinors. If we 

treat the transformation matrix 

ax? 

Ox” 
as an element of GL(D), we find that there are no finite-dimensional spinor rep- 
resentations of this group. Thus, we cannot define spinors with metric tensors 
alone. 

To remedy this situation, we construct a flat tangent space at every point 
on the manifold that possesses O(D) symmetry. Let us define vectors in the 
tangent space with Roman indices a, b, c,d, .... Let us define the vierbein as 
the matrix that takes us from the x-space to the tangent space and vice versa. 


(A.2,23) 


ene = Bus 
e%? = green (A.2.23) 
et et = §7°. 


We can now define a set of gamma matrices defined over either the tangent or 
the base space: 


ya = y", 
{y", vy} = —2g%”. (A.2.24) 
Thus, the derivative operator on a spinor becomes 
Vie a Opa (A.2.25) 
With this tangent space, we can now define the covariant derivative of the 
spinor w 
Viv =a. +00 w, (A.2.26) 


iu 


where o“” is the antisymmetric product of two gamma matrices and oo in 
called the spin connection. Notice that the spin connection is a true tensor in 
the uz index. Under a local Lorentz transformation, the field transforms as 


v > ey, Vive > eMVuw. (A.2.27) 


We can also use the O(3, 1) formulation of general relativity and dispense 
with Christoffel symbols. We can define 


Vu == Ou e we? Me”, (A.2.28) 
where M’s are the generators of the Lorentz group. Then we can form 

[Ver Vile (A.2.29) 
where 


ab b L b 
Rin = 9,00? — Bw? + ww? — ww? (A.2.30) 
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Notice that this tensor R4” yields an alternative formulation of the curvature 
tensor. 


We also demand that the covariant derivative of the vierbein be equal to 
zero: 


vies = One) + Pe + wre? = 0; (A.2.31) 


If we antisymmetrize this equation in wv, the Christoffel symbol disappears. 
Notice that the spin connection has 


D x $D(D — 1) (A.2.32) 


components. This is precisely the number of components in the antisym- 
metrized version of the above equation. Thus, we can solve exactly for the 
connection in terms of the vierbein. The Christoffel symbol and the vierbein 
are very complicated expressions of each other. 

Given these constrained expressions for the Christoffel symbol and the con- 
nection fields, we can now show the relationship between the curvature tensors 
in the two formalisms: 

Rie Re RO). ees (A.2.33) 


pay 
If we take an arbitrary spinor and make a parallel transport around a closed 
circuit with area A?’, we have 


we (1+ AY Reo). (A.2.34) 


py 


Notice that the o*” matrices are the generators of Euclidean Lorentz transfor- 
mations O(D). Thus, after a parallel displacement around a closed path, the 
spinor is simply rotated from its original orientation by an angle proportional 
to 


AY Re 


pv’ 


(A.2.35) 


Notice that we can make an arbitrary number of closed paths starting from a 
single point. Each time, the spinor performs a rotation. Notice that this forms a 
group. In fact, the group is simply O(D), which is called the holonomy group. 


A.3. A Brief Introduction to the Theory of Forms 


Let us define a one-form A by 
A= Ayan, (A.3.1) 
where A,, is a vector field and the differentials dx” are now anticommuting: 
Gt an dx dx", 
dx Naa 0. (A.3.2) 
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Let us define the derivative operator as 


d= dX" Oe (AS) 
Notice that because the derivatives commute 
[0,,, 9,] = 0, (A.3.4) 
so therefore 
dad =0, (A.3.5) 
where d is nilpotent. 
Let us now define a two-form: 
Pe ax’ Nax (A.3.6) 
Notice that the curvature associated with a vector field is a two-form: 
F=dA =dx"o,Ayax (A378 


= 4(0,Ay — 8A,) dx" A dx”. 
Because the d operator is nilpotent, we have 
dF =d’A=0. (A.3.8) 


Thus the Bianchi identity for the Maxwell theory, expressed in terms of forms, 
is nothing but the nilpotency of d. 
A form a is called closed if 


closed: dw = 0. (A.3.9) 
A form w is called exact if 
exact: w=dQ, (A.3.10) 


for some form Q. Thus, the curvature form F is exact because it can be written 
as the divergence of the one-form A. It is also closed because of the Bianchi 
identities. 

We can also combine this with a local gauge group with generators A,,. Let 


A = AjAg ax". (A.3.11) 
Then the curvature form is 
F=dA+AAA. (A.3.12) 
Furthermore, the gauge variation of the Yang-Mills field under 
N= KG (A.3.13) 
is 
SA=dA+AAA—AAA. (A.3.14) 


Inserting the variation of the field A into the curvature F, we find 


SF=FAA-AAF. (A.3.15) 
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Thus, the variation of the action is zero: 
8 Tr(F?) = 2Tr(F A A) —2Tr(A A F) = 0. (A.3.16) 
Let us now write the anomaly term FF in the language of forms. The 
divergence of the axial current is also the square of two curvatures, which is 
also a total derivative. In the language of forms, we find that this is an exact 
form: 
Tr(F A F) = dos, : (A317) 
where 
w3; = Tr(AdA + 2A’) (A.3.18) 
@3 is a three-form which we call a Chern—Simons form. In turn, its gauge 
variation is equal to another form that is also exact: 
6@3 = Tr(dA AdA) = dan, (A.3.19) 
where 
@,) = Tr(A A dA). (A.3.20) 


We also note that these identities apply equally well to Yang—Mills theories 
as to general relativity. For gravity, we have the gauge group O(3, 1). In other 
words, gravity has two gauge invariances, the general covariance of the coor- 
dinates x and the local Lorentz transformations of the tangent space. We will 
explain this more later. 

In general, an N form is defined as 


Ou Omer Nake? As axe”, (A.3.21) 


All of the above equations, of course, can be derived without the use of the 
theory of forms. However, forms give us a powerful shorthand that allow us to 
manipulate complex mathematical quantities. Notice, for example, that Stokes’ 
theorem, expressed in the language of forms, now becomes 


[ do= | w, (A.3.22) 
M aM 


where 3M is the boundary of the manifold M. 
Some simple properties of these forms are 


Oo, = (1) oO, Nw, 
d(wWy A Wy) = dWp A Wy + (—1)’ wp A dary. (A.3.23) 
Let us also introduce the Hodge star operator, which allows us to take the 


dual of a p-form and convert it to ann — p-form in n dimensions: 


eX"! A ee hae) = ae pyle tnst eae ho ae. “(Ned 24) 
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Some properties of the star operation are 
#*Wy = (—1)?" Pw, 
Wp A *Wp = Wg A *Wp. (A:3.25) 


We will devote the next few pages to proving the assertion in Chapter 8 
that an invariant polynomial is both closed and exact. We define an invariant 
polynomial as one that satisfies 


P(a) = P(g™'ag). (A.3.26) 


Let us start by defining a homogeneous invariant polynomial of degree r, 
which is dependent on the forms a;: 


| coin al (SARS GER © (A.3.27) 


Let us differentiate this polynomial, being careful to differentiate each form 
separately within P: 


BP Pttignss dais, 0), (A.3.28) 


l<i<r 
Each time the derivative d passes over a form @;, it picks up a factor: 
da; = (da;) + a;d(—1)”. (A.3.29) 
Now assume that 
a—> g lag. (A.3.30) 
For g close to unity, we can always write 


e— | ae 
6a =—-wWAa+t+aAw. (2331) 


and let us calculate the variation of a homogeneous polynomial P of degree r 
under this shift: 


OP (01; 05,....;C,) = 0 = > EY" [PEi.a.....@ Ao), -. ana 


l<i<r 


— (Pilea, ae. . eT ae am). A.30@2) 
The trick is now to add the contribution of both dP and 5P: 
dP +$5p= a (—1)"*"** Play, a, ..., Daj, ..., @): (A.3.33) 


l<i<r 


Now set each of the a; to be a curvature form that obeys the Bianchi identity 
Da; = 0. Thus, we find 


dP(2)=0 (A.3.34) 


as we claimed. 
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The second part of the proof is a bit more involved. Let us define 


Q=dwo+ordra, 


Q = da' +a! Aa’. (A.3.35) 
Our plan is now to show that 
P(Q’) — P(Q)=dQ (A.3.36) 


for some form Q. 
First we want to write a curvature form that allows us to interpolate 
continuously between Q and Q’. Let 


oO, =o-+1tn, 
n= —o. (A.3.37) 
Notice that the variable f allows us to interpolate between the two forms: 
t(=0 =o) =o. 
(= Te Oy =a (A.3.38) 


It is easy now to find the curvature form that interpolates between the two, as 
a function of f: 


Q, = dw, + a; A ®, (A.3.39) 
where 
t=0—> 2,=22. (A.3.40) 
Notice that the form varies between Q2 and {2’ as t varies from zero to one. 
Now let us write the form q such that 
q(B, vw) =rP(B,a,a---((r — 1)-times)--- a), (A.3.41) 
where the form @ is repeated (r — 1)-times in the invariant polynomial. This 
form q will be the key to showing that P is an exact form. 
By the reasoning given earlier, if we differentiate q, we find 
dq(n, 2,) = rdP(n, Q, -+-((r — 1)-times) - - - 22;) 
= q(Dn, Q,) — r(r — ItP(n, Q An — 7 AQ, 24, ..., 82). 
‘ (A.3.42) 
But we also have the identity, due to the fact that g is an invariant polynomial, 
2q(n An, Q:)+r(r — DP, Q& An— nA Q,..., 2) = 0. (A.3.43) 
In the previous equation, we used 
g=1+n7. (A.3.44) 


Putting both equations together, we find 
dP(S2,) 


A.3.45 
°F ( ) 


dq(n, 2,) = q(Dn, 2) + 2tq(n A n, 82;) = 
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Thus, we arrive at 


“CRC) eae 
/ = P(2')- PO) 


| 
=—d / q(w’ — w, 2,)dt = dQ. (A.3.46) 
0 


We thus obtain our final result: 


P(Q') — P(Q) = dQ, 
Gi — / q(w’ — w, 2,) dt + closed forms. (A.3.47) 
0 


Thus, an invariant polynomial based on curvature two-forms is both closed 
and exact. 


A.4_ A Brief Introduction to Supersymmetry 


In the late 1960s, physicists tried to construct the master group that would 
allow a synthesis of an internal symmetry group (like SU(3)) and the Lorentz 
or Poincaré group. They sought a group M that was a nontrivial union of an 
internal group U and the Poincaré group: 


M2.U- @ PF: (A.4.1) 


Intense interest was sparked in groups like SU(6, 6). However, the celebrated 
Coleman—Mandula theorem showed that this program was impossible. There 
are no unitary finite-dimensional representations of a noncompact group. So 
either: 


(1) the group M has continuous masses; or 
(2) the group M has an infinite number of particles in each irreducible 
representation. 


Either way, it is a disaster. However, it turns out that supergroups or graded 
Lie groups allow for an evasion of the no-go theorem. 

The work of Lie and Cartan concerned only continuous simple groups where 
the parameters p; were real. However, if we allow these parameters to be 
Grassmann-valued, we can extend the classical groups to the supergroups. 

Two large infinite classes of groups we will be interested in are the 
Osp(N /M) and SU(N /M) groups. 

Let us begin with the group O(N ), which preserves the invariant: 


O(N): = x;x; = invariant, (A.4.2) 
and the group Sp(M), which preserves the form: 
Sp(M): = OnCmnO, = invariant, (A.4.3) 
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where the C matrices are real antisymmetric matrices because the 6; are 
Grassmann-valued. The orthosymplectic group is now defined as the group 
that preserves the sum: 


Osp(N/M): — x;X; + OnCmnO, = invariant. (A.4.4) 
Notice that the orthosymplectic group obviously contains the product 
Osp(N /M) > O(N) @ Sp(M). (A.4.5) 


The simplest way to exhibit the matrix representation of this group is to use 


the block diagonal form 
O(N) A 
Osp(N/M) = A. 
sp(N/M) B Sp(M) (A.4.6) 


with simple restrictions on the A and B matrices. 
Similarly, the superunitary groups can be defined as the groups that preserve 
the complex form 


CN 6 C0" gam, 
Sinn = =O mn- (A.4.7) 
The bosonic decomposition of the group is given by 
SU(N /M) > SU(N) ® SU(M) @ U(1). (A.4.8) 
Let us write the generators of Osp(1/4) as 
Ma = (Pu, Muy, Qa) (A.4.9) 
which have the commutation relations 
[M,, Mel = fapMc. . (A.4.10) 
Written out explicitly, the commutators involving the supersymmetry generator 
are 
{Qe, Op} = 2y"C)ap Py 
[Qu, Pu} = 9, 
[Qa Myv] = (Guv)e Qp- (A4.11) 


What we want is an explicit representation of these generators, in the same 
way that 


Py, = —id, (A.4.12) 
is the generator of translations in x-space. Now we must enlarge the concept 


of space-time to include the supersymmetric partner of the x-coordinate. Let 
us define superspace as the space created by the pair 


Xu» Oy, (A.4.13) 
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where 6, is a Grassmann number. Let us define the supersymmetry generator 
a 
— eee eae fod 
Op = ae Oi) ae (A.4.14) 
where @ is a Grassmann number. We choose this particular representation be- 


cause the anticommutator between two such generators yields a displacement, 
as it should, 


{Qu, Oe} = —2(y" Chagi dn. (A.4.15) 
Notice that €Q makes the following transformations on superspace 
Ke eye 
Oy —> Oy + Ea- (A.4.16) 


Notice also that we can construct the operator 


a) 
Dz = oes + i(y"*O,0)w. (A.4.17) 
This anticommutes with the supersymmetry generator 
{Qa, Dg} = 0. (A.4.18) 


This is very important because it allows us to place restrictions on the repre- 
sentations of supersymmetry without destroying the symmetry. This permits 
us to extract the irreducible representations from the reducible ones. 

Let us now try to construct invariant actions under supersymmetry. Let us 
define the superfield V as the most general power expansion in this superspace 


Vie 0}, (A.4.19) 
Then a representation of supersymmetry is given by 
bV(x, 9) = V(x + 6x, 0 + 60) — VG, 6) = &° 0, V(x, 4). (A.4.20) 


Notice that this definition proves that the product of two superfields is also a 
superfield 


V, V2 = V3. 


Thus, we can construct a large set of representations of supersymmetry by this 
simple product rule. Now let us calculate the explicit transformation of the 
fields. We will sometimes find it useful to break up the four-component spinor 
into two two-component spinors, because of the identity: 


O(4) = SU(2) @ SU(2). (A.4.21) 


Using indices A and A, A = 1,2, we will write a Majorana four-spinor in 
terms of its SU(2) x SU(2) content: 


Oe) (A.4,22) 
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If we invert this, we find 


x4 =h(l+ys)x, 


Xa = 50 —-ys)x, (A.4.23) 
and 
Ce ( 0 ) 
Oe? 
Esp = 649 = —2,, — 648, 2 = 1. (A.4.24) 


In this notation, the covariant derivatives can be written as 


0 : 
Da = — — i(o")44042,, 


~ 964 
0 ; 
Dj = Pa a iG 0 on (A.4.25) 
where 
o” = (1,0). (A.4.26) 


The real vector superfield V can be decomposed as 
V(x, 6,6) =C — idx —ix'6 — 4i0°(M —iN) + 4i6°(M +iN) — 00%6A, 
+ i0?0(A — Si px’) — i600’ — ti px) 
— 16°6°(D + 30C). (A.4.27) 


We can now read off the supersymmetry transformation parametrized by ¢ on 
these 16 fields: 


5 C= Gabe 

5x = (M + YsN)E = iy"(Ay te YsO,C)o, 

6M =¢(A —i gx), 

ON = Cys(A —i px), (A.4.28) 
bAyp = il pA+ COX, 

6A = —io""S8, Ay — ysoD, 

6D = —it pysd. 

We call it a vector superfield because it contains a vector particle in its rep- 
resentation (not because the superfield itself is a vector field under the Lorentz 
group). In general, vector fields can be complex and they are reducible. To form 
irreducible representations, we will find it convenient to place constraints on 
them that do not destroy their supersymmetric nature. The constraints must 


therefore commute with the supersymmetry generator. 
Notice that, because D, anticommutes with the supersymmetry generator, 
we can impose this derivative on a superfield and still get a representation 
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of supersymmetry. Let us now try to construct various representations of 
supersymmetry based on this simple principle. We can impose 


Dig =O: (A.4.29) 
A superfield that satisfies this constraint is called a chiral superfield. Notice 


that it has half the number of fields of the original superfield but that it still 
transforms under the group correctly. A chiral superfield has the decomposition 


o(x,0)=A+26w — 60°F. (A.4.30) 
The variation of this superfield can also be read off the variation 
5 = —i[, 0 + OF]. (A.4.31) 
We easily obtain 
bA = 26, 
dy = —CF —id,Ao"e, (A.4.32) 


dF = —2i0, po". 
We could also try other combinations of constraints, such as 
Dad =0 (A.4.33) 


on a chiral superfield. We find, however, that the combination of the two 
constraints imposed simultaneously implies that @ is a constant. 
Another constraint might be 


D“D,d = 0. (A.4.34) 


This yields the linear multiplet. (Unfortunately, actions based on this are usu- 
ally equivalent to actions based on chiral superfields, so we learn nothing new.) 
Another constraint might be 


[D,, Dg]d = 0. (A.4.35) 
Unfortunately, this yields a constant field. We might also impose 
D;D° Did 0. (A.4.36) 
This again gets us back to the chiral superfield. Finally, we might also try 
D,D’¢ =0 (A.4.37) 


for real ¢. This actually yields an entirely new superfield, which we will use 
to build the Yang—Mills action. 

In summary, the only new ficlds that transform as irreducible representations 
of supersymmetry are the chiral supertield, the vector superfield, and the Yang— 
Mills superfield. The other combinations that we might try are either empty or 
redundant to the original set. 

Now Ict us discuss the problem of forming an invariant action by defining 
Grassmann integration. 
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Integration over these Grassmann variables must be carefully defined. 
Ordinary integration over real numbers, of course, is translation-invariant: 


iL. dx¢(x) = f- dxo(x +c), (A.4.38) 


co 


where c is a real displacement. We would like Grassmann-valued integration 
to have the same property 


i d0o() = / ddgio+c). -* (A.4,39) 


If we power expand this function g(@) in a Taylor series, we have the simple 
expression 


o(0)=a+t+bé. (A.4.40) 
If we define 
ne | dd, 
i | doe, (A.4.41) 
then translation invariance forces us to have 
| d6¢(0) =alt+ bh =(at+ be) + bh. (A.4.42) 
Thus, we must have /) = 0 and /; can be normalized to one: 
I=, 
I, =1. (A.4.43) 
or - 
[« — 0, [ 400 = Il. (A.4.44) 
In other words, we have the curious-looking identity 
a 
é=—. A.4.45 
foo3 ses 


With these identities, we can show that 


N N 
/ | | 46; 46; exp bs fn, = det(A;;). (A.4.46) 


i=l ij=l 


Thus, in general, invariant actions can be formed (see (A.4.27), (A.4.30)) 


/ d*6 d*xV(x, 6) — D-term, 


| d’6 d*xo(x,0) > F-term. (A.4.47) 
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The first integral only selects out the D-term of the superfield. The second 
integral only selects out the F-term of the chiral superfield. In general, we call 
these “F” and “D” terms. We can check that these are invariant actions: 


5 i axV = / d’xée°Q,V =0. (A.4.48) 


This is because the integral of a total derivative, in either x or 6 space, 1s zero. 
Now let us try to write simple invariant actions based on F and D invariant 
terms. The simplest invariant action is called the Wess—Zumino model: 


Z seaeeaa: 
= [ @xo0 + (/ ax Ei + img? + =o | + he.) . (A.4.49) 
Written out in components after performing the @ integration, it contains 
S= [as {—4(0, AY — $(0,BY — 4Xy"O.x +4F° +4G°}. (A.4.50) 


Notice that we have now constructed an action with an irreducible representa- 
tion of supersymmetry with a spin-0 and spin-+ multiplet: (+. 0). To construct 
the (1, 4) multiplet, we need the following construction for the Maxwell action, 
given by 


c= i d*x d’OW*Wa, (A.4.51) 
where 
Wa = D’DaV, 
W, = D?D,V, (A.4.52) 
D, We = 0, 
where V is a real vector supermultiplet, which transforms as 
bV=A-A, (A.4.53) 
V is real, but A is chiral: At = —A. Under this transformation, we find that 
OW, =O: (A.4.54) 


so the action is trivially invariant under both supersymmetry and U(1) gauge 
invariance. Notice that the vector supermultiplet contains the Maxwell field 
A,,, while the chiral supermultiplet A contains the gauge parameter A. Written 
out in components, this equals 


S= fas (—} Fi, — tVv"a,v + 4D’) (A.4.55) 
which is invariant under 
6A, = eyrV, 
by = (—40"" Fyy + ysDe, (A.4.56) 


ay 8 Eysy"d,W. 
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The next multiplet we wish to investigate is the (2, 3) multiplet. Histor- 
ically, it was thought that the Rarita-Schwinger theory was fundamentally 
flawed because it permitted no consistent couplings to other fields. However, 
physicists neglected to couple the Rarita-Schwinger field with the graviton. 
The inconsistencies all disappear for this multiplet. 


A.5_ A Brief Introduction to Supergravity 


There are at least four ways to formulate supergravity: 


(1) Components—this method relies heavily on trial and error. However, it 
yields the most explicit form of the action. 

(2) Curvatures—this method stresses the group theory and the analogy with 
Yang—Mills theory. 

(3) Tensor calculus—this yields precise rules for the multiplication of 
representations of supersymmetry. 

(4) Superspace—this is the most elegant formulation of supergravity; how- 
ever, it is also the most difficult. The higher NV superspace formulations 
of supergravity are still not known because the torsion constraints are too 
difficult to solve. 


We will concentrate on the method of curvatures because it resembles the 
Yang—Mills construction we have been using so far. 

Since Osp(1/4) has 14 generators, let us define the 14 connection fields of 
Osp(1/4) as 


A a ab wa 
hi, = (ur On > Wy) (ArS.1) 
Then the global variation of the connection fields is 
She = face 4, (A.5.2) 
where 
6) (676%. 67), (A.5.3) 


The covariant derivative is now given by 
Vu = 9, +hiM, 
Sop ter, Mapt Via  (@>b). (A54) 
The fields transform under a local gauge transformation as 
bh’ = d,e4 + hie” fez. (A.5.5) 
Now form the commutator of two covariant derivatives 


iVieeey lai M 4, (A.5.6) 
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where 
A A BpC fA 
RA, = 04h) — dhe + ACh, fcp- (A537) 
In component form, we have 
CeO 76, oe we —(uw<v), 
RUM) = 0,05" + war — (u <> v), (A.5.8) 
RE (0) Son, + eo — (ay): 
The variation of the curvature can easily be shown to be 
OR he fons (A.5.9) 


The action for supergravity is now given by 


S= fatxe0 (Ryn) Ryn MY ats Rial OF i OV POE Vig). 


(A.5.10) 
If we make a variation of this action, we find that the action is not totally 
invariant unless we set 


Rie) =o, (A.5.11) 
The action is invariant up to the term 
Om ok, AP). CA5Sa12) 


However, since we imposed this constraint (A.S.11) from the beginning, we 
find that the action, indeed, is fully invariant under the transformation. 

This constraint appears to be highly unnatural until we realize that it is the 
same as the vanishing of the covariant derivative of the vierbein in (A.2.31). 
Thus, we will choose the vierbein to have zero derivative in order to have the 
final invariant of the action. 

The final action is 


L= —=9eR — Ww? Dobe”. (A.5.13) 

Unfortunately, the situation for higher N actions is much less clear. The 
superspace method still has not been solved for the higher NV, but the V = 8 
supergravity has been constructed using a trick: by extending the dimension 
of spacetime to 11, we can construct N = 1. D = 11 supergravity. Then we 
compactify in order to reduce down to N = 8, D = 4 supergravity. 

The starting point for constructing the |1-dimensional supergravity is to 
realize that we need equal numbers of bosonic and fermionic fields. By trial 
and error, we sce that the following choices yield equal numbers of fields: 


ey = 59 x 0 — 1 = 44 components, 


Wo 


$(9 x 32 — 32) = 128 components, 
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9 
ANNE = ( -) (A.5.14) 


= 84 components, 


where the vierbein is transverse and traceless and [ - yw = 0. Then, by brute 
force, Cremmer, Julia, and Scherk proved that the following action is invariant 
in 11 dimensions: 


l = 
L= Soe as — sew ml? Dui (w+ &)]vp — we unro 
_ V2« 


384 —— (Wy PMNPORS ye 4 12 PPO yk (eer PF eor 


we CEST Pemee Or re rae (A.5.15) 
where 
bey = sKnl “Wm, 
dAunpe = eer ae (A.5.16) 
bWu = «'Du(O)n + page 857,02" \nF pors, 
and where 


OmaB = Oma + tw Tpmasow®, . (A.5.17) 


where Fyn pq is the curl of Ayyp and F mnpg is the supercovariantization of 
Fun pg, where we choose {f'“, 72} = 24°, we antisymmetrize according to 
ra? — 5(r4r? — F814), and indices ABC are flat and MN P are curved. 
The N = 1, D = 10 supergravity can be found by truncating the earlier 
action. The spinor decomposes nse a pair of Majorana—Wey] gravitinos and 
a pair of Majorana—Weyl spin-+ fermions. The vierbein decomposes into a 
10-dimensional vierbein and a scalar field #, while the antisymmetric tensor 
field decomposes into a boson field By. Thus, the set of reduced fields is 


{eA;6;Bun}, {Wasa}. (A.5.18) 


Our final action in 10 dimensions is 


e'L=-z5R-hn pee’ Dia a A olmasd ? Faye 
Ps 3 
— APM Dua — 7 (g"'8 uP) — —— Wa TMG v9) 
J2K i : 
A eo Ave oh eR + 6p Thy? 


Bey ay oy eee, (A.5.19) 
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where 

A _ Keng 
J2K 

6p = oa 

RE) 7 " ve 
OByn = age ne uwn = nnn = “a Tae ’ (A.5.20) 

3 

dA = Boel TC -8¢)n+-— go TMP Haw oes, 


/f2 
8Wm = «| Dun + —— a? Oo (08 nae 


and H = dB, and where ... means four-fermion Fermi-type terms that we 
are omitting. This is the low-energy limit of a Type IIA string (because the 
fermions have opposite chiralities from the dimensional reduction). Thus, there 
are no anomalies from this theory because there is no chiral asymmetry. 

The Type IIB theory, however, cannot be derived from dimensional reduc- 
tion because it has fermions of the same handedness (and hence can have 
anomalies). Type IIB has no covariant action at all (but has on-shell equations 
of motion and also a well-defined light cone action). 

Next, we would like to couple supergravity to Yang—Mills matter. The super- 
Yang—Mills multiplet by itself is given by 


Ange Kets (A.5.21) 


where a represents the elements of the isospin group. Notice that we have equal 
numbers of fermions and bosons on-shell. The final action is equal to 


eh =e“ Lse(Hunp) —1o 74 Fe Pee — $x°T "(Du (e)x)* 
— gk eX MTN CFA + Fey pM Wae + V2 Md) 
+ ba 2 gp 4 FOTN? y Hyp 
= A V2 Xe x! mea =) (ie Neti 
= sk m “Tunex ar enry — aK x “Tae Ke ae 
(A.5.22) 


where the surprising thing is that we must modify the condition H = dB so 
that 


H = dB -—27 "xq, (A.5.23) 
where 3 is the Chern—-Simons term 


= Tr(AF — igA’). (A.5.24) 
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The variation of B under a gauge transformation is now 
8B =27'"*« Tr(AdA), 
Of = 0: (AG5225) 
The action is invariant under 
SAG, = 39° HT ax’, 
ae {3Ax°n (A.5.26) 
— 300% XP — GANS y*Pyypon). 
The transformations of the supergravity fields are the same as before (with 


modified H field) and the new pieces that must be added to the transformation 
law are 


a ig BPMN Fra n+ 


aan 
dA = Te x“ unpn, 


8'Bun = 27 een (A.5.27) 
BW = — ag k OT Pox *(Puneo — SgunU po). 


We may ask the obvious question: Are there supergravity theories beyond 
N = 8? The answer is probably no. dus is because the supersymmetric 
a of the Osp(N /4) group have spin-}. If we take the maximum helicity 
state of the graviton and then operate on it with all possible Q’s, we find that 
the series eventually must terminate or else we create particles of spin-5 and 3: 


QuQp-+- QO: |graviton) . , (A.5.28) 


Theories of spin-3 and 3, although they can be constructed as free theories, are 
thought to be inconsistent when coupled to other particles. Thus, since there 
are eight half-steps between 2 and —2, we find that N can only equal eight 
in the above series. Thus, O(8) is the largest supergravity theory that does not 
have spins greater than 2. 


A.6 Notation 


For the purpose of clarity, we have deliberately dropped the normalization 
factors appearing in the functional path integrals and the N-point amplitude 
An. This should not be a problem because they can be easily restored by the 
reader. 
In our units, the Planck length and Planck mass are equal to 
= [hAGc~>] = 1.6 x 10° cm, 
= [AcG~'] = 2.2 x 1075 gm = 1.2 x 10'° GeV/c’. 


“We set c andh = 1. 
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We use the spacetime Lorentz metric (—, +, +, ..., +). Onthe world sheet, 
we use the two-dimensional metric (—, +), where the first (second) index refers 
to t (o). Our two-dimensional matrices are 


\ i _ (0 i 
PN NG 


v= ve. 

In general, we will use k,, to represent the physical momentum and p,, to 
represent the operator whose eigenvalue is the momentum. However, as in the 
literature, we will often drop this distinction and use both interchangeably. 

We use the following choices for the Regge slope 


where 


/ 


a 


] . 
> for open strings, 


fe 


a’ = + for closed strings. 


Whenever possible, we use Greek letters to denote curved space indices. 
We use yz and v to denote 26- and 10-dimensional Lorentz indices in curved 
space. We use Greek letters, a, 6 to denote two-dimensional curved world 
sheet indices. Flat space indices, either in Lorentz or in two-space, are usually 
in Roman letters, a, b, c. When the context is clear, we will sometimes use the 
metric (+,+,..., +) for the flat tangent space. 

We choose our gamma matrices such that 

Pipe — oly eed nae 


In an even number of dimensions, they are normalized so that 
(pu) =+1 if D=4k+2, 
(Mou) =-1 if D=4k. 


When the gamma matrices appear with more than one index. we take the sum 
of all antisymmetric combinations of these indices. For example, we normalize 
the gamma matrices so that 


p48 = rT Bs Bled ea Big (a 
We define the light cone gamma matrices as 
| 
J2 


| 
(po _ po-i 
= ae. 
When specializing to the case of 10 or 4 dimensions, we will often just use the 
symbol y*. 


i (ea rey, 


r- 


I 
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When we make the reduction down to SO(8) for the light cone formalism, 
we will use the direct products of 2 x 2 Pauli matrices: 


Y = 1) Om, 
y =i18%71®n, 
Y =110%3@h, 
y* =i @H@l, 
Pai, ®t Od, 
Y im @1@q, 
Y im ® 1 QT, 
y>=1@1@1, 
Van = 3 ViaVin — YoaVin)> 
where t; are the Pauli matrices, and each gamma matrix is a direct product of 
three 2 x 2 blocks. 
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Superstrings and M-theory are provocative and controversial, but 
unarguably among the most interesting and active areas of research 
in current physics. Called by some “the theory of everything,” super- 
strings may solve a problem that has eluded physicists for the past 50 
years, the final unification of the two great theories of the twentieth 
century, general relativity and quantum field theory. 


Now, here is a thoroughly revised, second edition of a course-tested 
comprehensive introductory graduate text on superstrings which 
stresses the most current areas of interest, not covered in other pre- 
sentations, including: 


¢ Four-dimensional superstrings 

¢ Kac-Moody algebras 

e Teichmiuller spaces and Calabi-Yau manifolds 
e¢ M-theory membranes and D-branes 

e Duality and BPS relations 

e Matrix models 


The book begins with a simple discussion of point particle theory 
and uses Feynman path integrals to unify the presentation of super- 
strings. It has been updated throughout, and three new chapters on 
M-theory have been added. 


Prerequisites are an acquaintance with quantum mechanics and rel- 
ativity. 
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